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IT) (54) Title: GENERATING, VIEWING, INTERPRETING, AND UTILIZING A QUANTITATIVE DATABASE OF METABO- 
O LITES 

^ ( 57 ) Abstract: This disclosure provides methods for the creation of a quantitative database of metabolites, particularly lipid metabo- 
lites, using chromatographic technology ; methods for assembling that information into a visual format for interpretation, and methods 
Q of this information to identify and understand metabolome-wide effects, for instance those effects influenced by pharmaceuticals, 
^ genes, toxins, diet or the environment. Also provided are metabolite databases, such as lipid metabolite databases, that are stored on 
^ a computer readable medium, which include quantitative measurements of a plurality of metabolites. 
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GENERAT1NG, VIEWING, INTERPRETING, AND UTILIZING 
A QUANTITATIVE DATABASE OF METABOLITES 

FIELD 

5 This disclosure relates to ways of quantifying metabolites and collecting quantitative data on 

metabolites, a database of quantified metabolite profiles, and methods of mining and visualizing 
selected subsets thereof. 

BACKGROUND 

10 The recent explosion of data acquisition and analysis technology, termed informatics, 

promises to revolutionize predictive and diagnostic medicine. The information readily available to 
doctors and scientists today dwarfs that of even a few years ago, and will expand at an even more 
accelerated rate in the next few years. Managing this information and applying it to useful purpose 
are formidable challenges. 

15 Currently, genomics is the most developed and recognized form of biological informatics. 

Genomics developed to simultaneously identify the elements of heredity and to assign biological 
function to these elements. Despite the inherent complexity of the genome, the invention of just a 
few molecular tools enabled genomics to flourish into the science known today. In the near future, it 
is likely that most common genetic diseases will have been identified, many using genomic tools. 

20 The power of the knowledge emerging from the genome is that identifying the genetic basis of an 
inherited disease can provide logical strategies to treat those afflicted on an individual basis. 
However, genomics is not a panacea for predictive medicine because phenotype is not necessarily 
predicted by genotype. Beyond its application to diseases with demonstrably genetic causes, 
however, the direct utility of genomics by itself diminishes. 

25 Ultimately, changes in phenotype and not changes in genes (genotype) are of direct interest 

to nutrition and health. The gap between genotype and phenotype is spanned by many biochemical 
steps, each with individual specificities and a sensitivity to various influences, including diet and the 
environment. In the chain of biomolecules from genes to phenotype, metabolites are the quantifiable 
molecules with the closest link to phenotype. Many phenotypic and genotypic states are 

30 characterized or predicted by differences in the concentration of metabolites within biological tissues 
or fluids. For example, the progression of coronary artery disease can be predicted by the serum 
concentration of cholesterol and the presence of non-insulin dependant diabetes is characterized by 
elevated plasma free fatty acids. 

Metabolite informatics, or metabolomics, represents a more logical approach than genomics 

35 for identifying trends or metabolic profiles of specific diseases. While the assessment of disease in 
man has been pursued using individual metabolite assessments, there are no technologies that enable 
the accumulation of diverse metabolome data in a single seamless and expandable resource. Such a 
resource would allow global metabolic effects of disparate affectors to be compared and contrasted. 
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Data for such a resource would need to be quantitative so that data from many investigators, 
analytical technologies, and sample matrices could be integrated and compared. A quantitative 
database of metabolites containing samples from systems treated with many affectors or expressing 
many phenotypic or genotypic traits could be used to identify the molecular mechanisms consistent 
5 and divergent across many biological systems and individual samples and sample collections. 

Early attempts to use a metabolomic strategy for investigating phenotype have proven 
valuable across a broad spectrum of biological research. In microbiology, changes in metabolite 
profiles were used to describe the global metabolic response and variable glucose metabolism of E. 
coli under different growth conditions (Tweeddale etalj. Bacteriology 180:5109-5116, 1998). 

10 Metabolome analyses were also used to identify the global changes in E. coli metabolism caused by 
changes in population density (Liu etal., J. Bacteriology 182:4158-4164, 2000). Raamsdonk et al 
(Nature Biotechnology 19:45-50, 2001) used metabolomic analyses of yeast to identify the metabolic 
function of deleted genes for which there was no observable phenotypic consequence of their 
deletion. Using metabolomics to identify the function of genes demonstrates the versatility and 

15 power of metabolomics. Unlike genomics and proteomics, metabolomics can be used to identify 
changes that occur at all levels of biology from genes to environment. The direct results of 
nutritional, genomic or expression differences can be observed in a metabolite profile. This strategy 
is also widely accepted in plant research as a method for screening for desirable traits, and for 
understanding the phenotypic expression of genes (Fiehn et al, Nature Biotechnology 18:1 157-1 161, 

20 2000; Glassbrook et al, Nature Biotechnology 18:1 157-1 161, 2000). 

What is needed is a system for creating a quantitative bioinformatic database of metabolites, 
such as lipid metabolites, suitable for integrative research and valid comparative studies across many 
disciplines and sample systems. Further, there is a need to develop easy, understandable tools for 
mining, visualizing and interpreting this bioinformatic resource. Technologies are needed that can 

25 create and interact with accessible annotated databases of metabolite concentrations reflective of 
individuals in various phenotypic states. 



SUMMARY OF THE DISCLOSURE 

This disclosure provides methods for generating and storing quantitative metabolome data, 
30 particularly lipid metabolome data, in a way that is infinitely expandable and thus suitable for 

creating a quantitative database of metabolites. Further, this disclosure provides methods for mining 
this database with visual tools, including computer-mediated user interfaces, to discover relations 
among metabolites from different subsets of the database. 

Particular examples provided herein relate to methods of generating, assembling, organizing, 
35 mining, analyzing, and displaying lipid metabolomic (lipomic) data. 

The power and accuracy of predictive diagnostics stand to improve dramatically as a result 
of lipid metabolomics. The high definition of data obtained with this approach allows multiple rather 
than single metabolites to be used in markers for a group. Because as many as forty fatty acids are 
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quantified from each lipid class, and up to fifteen lipid classes can be quantified easily, more than six 
hundred individual lipid metabolites can be measured routinely for each sample. Because these 
analyses are comprehensive, only the most appropriate and unique metabolites are selected for their 
predictive value. Thus, comprehensive lipid analysis promises to greatly improve predictive 
5 diagnostics for phenotypes that directly or peripherally involve lipids. 

Also provided herein are databases and computer systems for storing, accumulating, sorting, 
selecting, and analyzing metabolomic data. 

Further provided embodiments are internal standards and internal standard compositions, 
particularly internal standards and internal standard compositions useful for analysis of lipids in 
10 biological samples. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is a schematic overview of metabolomic analysis as provided herein. Quantitative 

analysis is used to measure a plurality of metabolites from a sample; the raw data produced by such 
15 analysis is optionally subjected to one or more transformations (e.g., computer calculations), 

including for instance integration of the area under a chromatogram curve with or without correction. 

Raw data and/or transformed data are entered into a database of results. In certain of the provided 

embodiments, a quality control mechanism compares the entered data against existing data in the 

database and identifies aberrant or erroneous data, which may lead to re-testing or repeated analysis. 
20 The database can be queried, for instance using filters or other discrimination mechanisms, and 

subsets of data that fit the query displayed. Such displays may be in any format, for instance in 

statistical or graphical formats as provided herein. 

Figure 2 is a schematic representation of certain lipid metabolite analysis embodiments. 

Chromatographic data is entered into a database, which can be mined for desired information and 
25 presented in the form of a graphical interface (for instance a heat map or targeting chart, as shown in 

the second panel). Such interfaces may be optionally provided in interactive form on a computer 

system, or remotely across, for instance, the Internet or another computer communication system. 

Data mined from the cumulative lipid metabolite database can be used, for instance, for clinical or 

diagnostic testing for a propensity to obesity), or to identify specific metabolic targets of drugs, 
30 as described in more detail herein. 

Figure 3 is a diagram showing an overview of the pathways of de novo fatty acid 

metabolism in humans. 

Figure 4 is a set of chromatograms of the indicated samples, produced by gas 

chromatography. FIG 4 A shows the gas chromatogram of a sample of Menhaden oil. FIG 4B shows 
35 a mirrored chromatogram, in which the Menhaden oil chromatogram is displayed top to bottom with 

a control chromatogram that contains standard compounds for comparison (labeled "Standard 

Sample"). Major peaks are identified as indicated. 
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Figure 5 is a representative "heat map" display of lipomic data, illustrating effects of 
rosiglitazone treatment on individual lipid metabolites. The concentration (expressed in nmol/g 
sample) of each lipid metabolite from treated and untreated mice was used to generate a heat map. 
The tissue and lipid class of each sample is indicated in the row headers (left). The fatty acid or 
5 sterol is indicated in the column headers (top). Color coding indicates the percentage difference 
between a control sample and the test sample, as explained below and in Example 1. The column 
headers represent an individual fatty acid present in the lipid classes, which are displayed on the left. 
The magnitude of the difference, expressed as a percentage change in the quantitative data between 
treated and untreated mice, is represented by color according to the legend. Differences not meeting 
10 a P < 0.05 are displayed in black. 

Summary data is presented in the smaller chart to the right, and includes nM of each fatty 
acid for each tissue: (1) total fatty acids, (2) saturated fatty acids, (3) mono-unsaturated fatty acids, 
(4) poly-unsaturated fatty acids, (5) n3, (6) n6, (7) n7, (8) n9 unsaturated fatty acids, and (9) 
plasmalogens ("dm"). 

15 Figure 5* is a duplicate of Figure 5, but is printed in grey-tones rather than in color. 

Figure 6 is a representative "targeting chart" display of lipomic data. This chart shows the 
different degrees of lipid metabolite changes when an animal is treated with CL3 16,243, a p-3 
adenergenic agonist versus rosiglitazone, a thiazolidinedione. 

Figure 6' is a duplicate of Figure 6, but is printed in grey- tones rather than in color. 

20 Figure 7 shows an example of a heat map indicating that rosiglitazone treatment exerts 

strong and tissue-specific effects on lipid class metabolism. The concentration (expressed in nmol/g 
sample) of each lipid metabolite from treated and untreated mice was used to generate the summary 
data displayed here as a heat map. The first column displays the quantitative difference in the 
concentration of each lipid class between the groups. The next columns, in order, describe the 

25 quantitative difference in the concentration of saturated fatty acids, monounsaturated fatty acids, 
polyunsaturated fatty acids, n3 fatty acids, n6 fatty acids, n7 fatty acids, n9 fatty acids, and 
plasmalogen lipids among the groups. The magnitude of the difference, expressed as a percentage 
change in the quantitative data between treated and untreated mice, is represented by color. 

Figure T is a duplicate of Figure 7, but is printed in grey-tones rather than in color. 



30 



DETAILED DESCRIPTION 



Abbreviation 



CDP-DAG: CDP-diacylglycerol 

CE: cholesterol ester 

35 CL: cardiolipin 

DAG: diacylglycerides 

FAME: fatty acid methyl ester 

FFA: free fatty acid 

LMP: lipid metabolite profile 

40 LY: lyso-phosphatidylcholine 
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LyCL: 


lysocardiolipin 


LyPE: 


lysophosphatidylethanolamine 


MAG: 


monoacylglycerides 


PA: 


phosphatide acid 


PC: 


phosphotidylcholine 


PE: 


phosphatidylethanolamine 


PG: 


phosphatidylglycerol 


PI: 


phosphotidylinositol 


PS: 


phosphotidylserine 


PS/I: 


phosphotidylinositol / phosphotidylserine 


SP: 


sphingomyelin 


TAG: 


triacylglycerol 



//. Explanation of Certain Terms 

15 Unless otherwise noted, technical terms are used according to conventional usage. 

Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, 
published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al (eds.), The 
Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182- 
9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk 

20 Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8). 

In order to facilitate review of the various embodiments, the following explanations of 
certain terms are provided: 

Biological Sample: Any biological material, such as a cell, a collection of cells (e.g., 
cultured cells), a tissue sample, a biopsy, or an organism. Biological samples also include blood and 

25 blood products (e.g., plasma) and other biological fluids (e.g., tears, sweat, saliva and related fluids, 
urine, tears, mucous, and so forth). Tissue samples can be from any organ or tissue in the body, 
include heart, liver, muscle, adipose, brain, lung, testes, and brain. 

Biological samples may be from individual subjects (e.g., animals, such as humans, mice, 
rats, monkeys, chickens, cats, dogs, pigs, horses, cows, fruit flies, or worms) and/or archival 

30 repositories. The samples may be acquired directly from the individuals, from clinicians (for 

instance, who have acquired the sample from the individual), or directly from archival repositories. 

Informatics: A global term used to describe a collection of modern, usually "high 
throughput" and computer-based scientific techniques that provide, generate, accumulate, and/or 
particularly analyze information about the genotypic and/or phenotypic and/or metabolic state of a 

35 cell or organism. Such techniques include genomic analyses and proteomic analyses, as well as 
metabolomic analyses. Informatics represents a subtle, but significant, shift in perspective among 
biologists. Whereas historically, scientists were accustomed to simplifying their systems to make 
metabolic interpretations, informatics allows scientists to embrace biological complexity and to make 
metabolic or phenotypic inference on the basis of as much information as possible. Genomics has 

40 brought to us the concept of high throughput science, and as a result, it has demonstrated the power of 
non-targeted and unbiased data acquisition. Although non-targeted data acquisition is uncommon in 
metabolite analysis, it does not violate the hypothesis-oriented procedure for scientific study. Rather, 
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high-throughput and non-targeted data acquisition simply allows scientists to test their specific 
hypotheses on a larger, non-biased dataset This investigative process functions differently than in a 
traditional reductionist approach, where experiments are designed to address single questions. 
Informatics focuses on obtaining accurate data that can be integrated with other datasets so that future 
5 hypotheses can be tested on a database in silico rather than at the laboratory bench. This method of 
investigation is suited to genomics, where sequences from disparate sources are integrated easily into 
one database because the genetic code is essentially universal. Because metabolomic data is 
influenced by the environment, and can be different depending on the time and conditions under 
which the sample is taken, a metabolomic database involves providing for considerably more 

10 complexity than is seen in a genomic database. 

Lipid: As used herein, the term lipid refers to a class of water-insoluble, oily or greasy 
organic substances, that are extractable from cells and tissues by nonpolar solvents, such as 
chloroform or ether. The most abundant kinds of lipids are the fats or triacylglycerols, which are 
major fuels for most organisms. Another class of lipids is the polar lipids, which are major 

15 components of cell membranes. The following table (Table 1) provides one way of grouping major 
types of lipids; these have been grouped according to their chemical structure: 
Table 1: 



Lipid type 


Representative examples or sub-groups 


Triacylglycerols 




Waxes 




Phosphoglycerides 


phosphatidylethanolamine 




phosphatidylcholine 




phosphatidylserine 




phosphatidylinositol 




cardiolipin 


Sphingolipids 


sphingomyelin 




cerebrosides 




gangliosides 


Sterols and their fatty 


(see Table 3) 


acid esters , 



Lipid metabolites may also be broken down into other recognized classes, such as those shown in 
20 Table 2: 
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Table2: 



SCIENTIFIC NAME 


ABBREVIATION 


Lysorhosphatidylcholine 


LY 


Sphingomyelin 


SP 


Phosphatidylcholine 


PC 


PVi/*\cnKoti/1\r1 carina 

riiospiiaiiuyi serine 


ro 


Phosphatidylinositol 


pi 


Phosphatidylethanolamine 


PE 


Cardiolipin 


CL 


Free Fatty Acids 


FFA 


Monoacylglycerides 


MAG 


Diacylglycerides 


DAG 


Triacylglycerides 


TAG 


Cholesterol Esters 


CE 


Phosphatide acids 


PA 


Phosphatidylglycerols 


PG 


CDP-diacylglycerols 


CDP-DAG 


Lysocardiolipin 


LyCL 


Lysophosphatidylethanolamine 


LyPE 



Specific subclasses (or groups of classes) of lipids can be distinguished based on the position of the 
fatty acids on the lipid back bone. For instance, the following are positionally specific isomers of 
5 lyso-lipid classes: 1-acyl, 2-lyso-x (where x is PC, PS, PE, PI, PG, or PA); l-Iyso,2-acyl-x (here x is 
PC, PS, PE, PI, PG, or PA); 1-acyl, 2,3-lyso-monoacylglyceride; 1-lyso, 2-acyl, 3-lyso- 
monoacylglyceride; 1, 2-acyl diacylglydceride; and 1,3-acyl diacylglyceride. 

Also included in the term lipid are the compounds collectively known as sterols. Table 3 
shows representative sterols. 
10 Table 3: 



SCIENTIFIC NAME 


MOLECULAR 


COMMON NAME 




FORMULA 




5b-cholestan-3b-ol 


C27H4gO 


coprostanol 


5a-cholestan-3b-ol 


C27H48O 


dihydrocholesterol 


5-cholesten-3b-ol 


C27H 46 0 


cholesterol 


5,24-cholestadien-3b-ol 


C27H440 


desmosterol 


5-cholestan-25a-methyl-3b-ol 


C28H420 


campesterol 


5-cholestan-24b-methyl-3b-ol 


C28H420 


dihydrobrassicasterol 


5-cholesten-24b-ethyl-3b-ol 


C29H50O 


b-sitosterol 


5,22-cholestadien-24b-ethyl-3b-ol 


C29H480 


stigmasterol 



Metabolite: A biomolecule that has a functional and/or compositional role (such as a 
component of a membrane) in a biological system, and which is not a molecule of DNA, RNA, or 
protein. Examples of metabolites include lipids, carbohydrates, vitamins, co-factors, pigments, and 
15 so forth. Metabolites can be obtained through the diet (consumed from the environment) or 

synthesized within an organism. Genes and proteins exist in large part to break down, modify, and 
synthesize metabolites. Metabolites are not only directly responsible for health and disease, but their 
presence in a biological system is the result of a variety of factors including genes, the environment, 
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and direct nutrition. By profiling the metabolite composition of a biological sample, for instance 
using the methods described herein, data on genotype, metabolism, and diet can be obtained in great 
detail. This data can be linked to clinical information and used to identify the true biochemical basis 
for health and disease. 

5 Lipids are perhaps the most important subset of metabolites, because dietary lipids and lipid 

metabolism are clearly linked to the incidence and progression of several major degenerative 
diseases, including heart disease, diabetes, obesity, auto-immunity, and chronic inflammation. 
Moreover, because lipids are the only major nutrients that survive digestion intact, highly accurate 
information on individual nutrition can be gained from a lipid metabolite profile. Thus, a lipid 
10 metabolomic approach provides information encompassing the entire spectrum of factors that 
influence disease. 

Each fatty acid may be found as a component of any lipid class, and in such combination is a 
different metabolite than it is on its own (free) or as a component in any other lipid class. Thus, 
' palmitoleic acid in cholesterol esters is a distinct metabolite from palmitoleic acid in 
15 triacylglycerides, and so on. By way of example, if a system is used in which lipids are categorized 
into 17 classes (as shown in Table 2), and there an analysis determines the concentration of 38 fatty 
acids and sterols are determined in each class, then 17 x 38, or 646 specific metabolite concentrations 
may be determined. 

Metabolomics: Highly parallel acquisition, databasing, and analysis of metabolite levels in 
20 a biological sample. In some instances, the sample is obtained from a subject or individual currently 
experiencing or being maintained under one or more defined condition(s). There are several levels of 
metabolomics - these can be differentiated for instance based on the scope of the individual 
metabolite profile, where scope refers to the number or type of metabolites measured in the individual 
analysis. Thus, lipid metabolomics is the study or analysis of a set of individual lipid metabolites. 
25 Carbohydrate metabolomics is the study or analysis of a set of individual carbohydrate metabolites. 
The set of data produced from analysis of an individual sample is referred to herein as a individual 
lipid metabolite/metabolic profile ("lipomic profile") of that sample. Certain examples of lipid 
metabolite profiles include a highly comprehensive set of metabolite measurements (a profile) by 
multi-parallel analyses. 

30 The comparison of two metabolite profiles of similar scope (/.<?., containing information 

about the same or a similar or overlapping set or subset of metabolites) from cells/tissues/subjects 
that have been differently treated, or that are genetically different or different based on disease state 
or condition, provides information on the metabolic effects of the difference. 

A metabolome is a data set that includes levels of metabolites in a biological system (e.g., a 

35 cell, tissue, biological fluid, or whole subject) under specific conditions; a multidimensional 

metabolome includes such data from like samples over a variety of conditions {e.g., time points, 
treatment points, different drug or other treatments, and so forth). 
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Quantitative metabolomic data as discussed herein include molar quantitative data, mass 
quantitative data, and relational data by either moles or mass (mole % or weight %, respectively) for 
individual metabolites, or subsets of metabolites. Quantitative aspects of metabolomic samples may 
be provided and/or improved by including one or more quantitative internal standards during the 
5 analysis, for instance one standard for each lipid class (in a lipomic profile). Internal standards 
employed in the methods described herein enable true quantification of each fatty acid from each 
lipid class, whereas traditional lipid analysis methods produce data in either a percent-of-total format 
or as a mixed population of lipid metabolites. Provided internal standards are designed to reflect any 
loss of fatty acid due to oxidation, discrimination, or cross-contamination. 
10 Using methods described herein, quantitative data can be integrated from multiple sources 

(for instance, samples generated from different labs, samples from different subjects, or merely 
samples processed on different days) into a single seamless database, regardless of the number of 
metabolites measured in each discrete, individual analysis. 

Metabolite fingerprint (or linked profile): A distinct or identifiable pattern of metabolite 
15 levels, for instance a pattern of high and low metabolites of a defined set, such as a biogenerative 
pathway. In specific embodiments, the metabolite levels in the fingerprint are absolute metabolite 
concentrations. Metabolite fingerprints (also referred to as linked profiles, e.g., a disease-linked 
profile or toxin-linked profile) can be linked to a tissue or cell type, to a particular stage of normal 
tissue growth or disease progression, to a dietary limitation or supplementation, or to any other 
20 distinct or identifiable condition that influences metabolite levels (e.g., concentrations) in a 

predictable or associatable way. Metabolite fingerprints can include relative as well as absolute 
levels of specific metabolites, but absolute levels (e.g., concentrations) are preferred in many 
embodiments. Specific examples of metabolite fingerprints are lipid metabolite fingerprints. 

Pharmaceutical/therapeutic agent: Any agent, such as a protein, peptide (e.g., hormone 
25 peptide), other organic molecule or inorganic molecule or compound, or combination thereof, that has 
one or more effects on a biological system, such as a desired therapeutic or prophylactic effect when 
properly administered to a subject. 

Quantified metabolic profile: A set of quantified measurements of one or more 
metabolites. The profile usually contains more than one quantified measurements for a metabolite 
30 and provides a metabolic snap shot of a condition. Specific examples of quantified metabolic profiles 
are specific for a condition to which an organism is subject, such as a genotype, for instance a 
knockout of a specific gene; a dietary limitation or supplementation; a disease or disease state; a 
treatment with a compound, for instance a drug, toxin, suspected toxin, pharmaceutical agent, or 
compound that is a candidate for a pharmaceutical agent, and so forth. 
35 Quantified measurement of a metabolite: A measurement of the concentration of a 

metabolite, obtained by using an internal standard for the metabolite. The measurement is usually 
readily comparable with any other measurements of the metabolite, e.g., from a different sample from 
a same or different organism, which different organism is subject to the same or a different condition, 
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or samples generated using a different method or approach for obtaining the measurements. The 
quantified measurements can be integrated from multiple sources (whether it is work from different 
labs, samples from different subjects, or merely samples processed on different days) into a single 
database, regardless of the number of metabolites measured in each discrete, individual analysis. For 
5 example, quantified measurements of a lipid generally include measurements of the concentration of 
the lipid within each lipid class using one or more internal standards for each lipid class. The 
measurements can be compared with any other measurements of the lipid regardless how the 
measurements were obtained and can be integrated into one database readily searchable for useful 
indications or patterns. 

10 Subject: Living multi-cellular vertebrate organisms, a category that includes both human 

and non-human mammals. 

Unless otherwise explained, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. 
The singular terms "a," "an," and "the" include plural referents unless context clearly indicates 

15 otherwise. Similarly, the word "or" is intended to include "and" unless the context clearly indicates 
otherwise. Hence "comprising A or B" means include A, or B, or A and B. It is further to be 
understood that all base sizes or amino acid sizes, and all molecular weight or molecular mass values, 
given for metabolites, nucleic acids or polypeptides are approximate, and are provided for description. 
Although methods and materials similar or equivalent to those described herein can be used in the 

20 practice or testing of the present invention, suitable methods and materials are described below. All 
publications, patent applications, patents, and other references mentioned herein are incorporated by 
reference in their entirety. In case of conflict, the present specification, including explanations of 
terms, will control. In addition, the materials, methods, and examples are illustrative only and not 
intended to be limiting. 

25 

///. Overview of Several Embodiments 

One embodiment is a method of generating a quantitative metabolomic database, which 
includes generating a plurality of quantitative metabolite profiles from a plurality of biological 
samples and assembling the plurality of metabolite profiles into a database. Biological samples for 

30 such methods may be selected from individual subjects and/or archival repositories, and may be 

acquired directly from individuals, from clinicians, or from archival repositories directly. In specific 
examples, the biological samples are taken from animals, for instance humans, mice, rats, monkeys, 
chickens, cats, dogs, pigs, horses, cows, fruit flies, or worms. 

Also disclosed are methods of providing a metabolomic profile database. In certain 

35 embodiments, the metabolomic profile database is a lipomic profile database. One such method 
involves collecting a biological sample, performing quantitative lipid metabolite analysis on it to 
generate a lipomic profile for the sample, entering the lipomic profile into one or more tables (for 
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instance, a table on a computer), and repeating these steps a plurality of times. The plurality of data 
entries in the table(s) is a lipomic database. 

Also provided are methods of permitting (for instance, for a fee) access to the metabolomic 
profile databases described herein. Examples of such methods involve embodiments in which access 
5 is through a computer interface, for instance from a remote computer across the Internet to the 
computer that contains the database itself. 

Further embodiments are methods of generating quantitative lipomic data. Certain of such 
methods include separating a biological sample into fractions based on a plurality of lipid classes, and 
measuring the quantity of a plurality of lipid metabolites in the fractions. Either separating or 

10 measuring in these methods may involve a chromatographic method, such as thin-layer, gas and/or 
liquid chromatography. The plurality of lipid classes may include, for instance, phospholipids, 
glycerides, and other lipids. An alternative division of lipids into class may be as follows: lyso- 
phosphatidylcholines, sphingomyelins, phosphatidylcholines, phosphatidylserines, 
phosphatidylinositols, phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, 

15 diacylglycerides, triacylglycerides, and cholesterol esters. In examples of these methods, at least one 
quantitative internal standard is included for each lipid class. 

In the methods described herein, lipid metabolites may include tetradecanoic acid, 
pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, octadecanoic acid, eicosanoic acid, 
docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9-hexadecenoic acid, 11-octadecenoic acid, 

20 9-octadecenoic acid, 1 1-eicosenoic acid, 5,8,1 1-eicosatrienoic acid, 13-docosenoic acid, 15- 
tetracosenoic acid, 9,12,15-octadecatrienoic acid, 6,9,12,15-octadecatetraenoic acid, 11,14,17- 
eicosatrienoic acid, 8,11,14,17-eicosictetraenoic acid, 5,8,11, 14, 17-eicosapentaenoic acid, 
7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19-docosahexaenoic acid, 6,9,12,15,18,21- 
tetracoshexaenoic acid, 9,12-octadecadienoic acid, 6,9,12-octadecatrienoic acid, 11,14-eicosadienoic 

25 acid, 8,11,14-eicosatrienoic acid, 5,8,1 1,14-eicosicatetraenoic acid, 13,16-docsadienoic acid, 

7,10,13,16-docosicatetraenoic acid, 4,7,10,13,16-docosapentaenoic acid, 9-trans-hexadecenoic acid, 
9-trans-octadecenoic acid, 8-eicosaenoic acid, 5-eicosaenoic acid, plasmalogen fatty acids, 5b- 
cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-oI, 5,24-cholestadien-3b-ol, 5-cholestan-25a- 
methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten-24b-ethyl-3b-ol, or 5,22-cholestadien-24b- 

30 ethyl-3b-ol, for instance. Individual fatty acids may be found as a component of any lipid class, and 
in such combination is a different metabolite than it is on its own (free) or as a component in any 
other lipid class. Thus, palmitoleic acid in cholesterol esters is a distinct metabolite from palmitoleic 
acid in triacylglycerides, and so on. 

Further provided embodiments are methods for presenting analysis of a plurality of 

35 individual lipid metabolite profiles, which methods involve designating the plurality of individual 
metabolite profiles (for instance, from within a cumulative database of such profiles), identifying at 
least one difference or at least one similarity in a metabolite in the plurality of individual metabolite 
profiles, and displaying at least one difference or at least one similarity in a metabolite in the plurality 
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of individual metabolite profiles. In specific examples of such embodiments, the displaying 
generates a web page for viewing. Such viewable web page may include, for instance, a 
representation of metabolite differences or similarities in the form of a heat map or targeting chart, or 
both. 

5 Lipomic databases as described herein can be used in disease diagnosis, prognosis, or 

prediction, for instance by screening the lipomic database for a lipid metabolite fingerprint that is 
linked to the disease. These methods are also encompassed herein. 

Further provided methods include methods of determining a metabolic effect of a condition 
(such as a genotype, for instance a knockout of a specific gene; a dietary limitation; a disease or 

10 disease state; a treatment with a compound, for instance a drug, toxin, suspected toxin, 

pharmaceutical agent, or compound that is a candidate for a pharmaceutical agent) on a subject. 
Examples of such methods involve subjecting the subject to the condition, taking at least one 
biological sample from the subject (usually after they are subjected to the condition), analyzing the 
biological sample to produce a test lipomic profile for the subject, comparing the test lipomic profile 

15 for the subject with a control lipomic profile, and drawing conclusions about the metabolic effect of 
the condition based on differences or similarities between the test lipomic profile and the control 
lipomic profile. The control lipomic profile may be for instance a compiled lipomic profile 
assembled from a plurality of individual lipomic profiles, or a pre-condition (e.g., pre-treatment) 
lipomic profile from the subject. 

20 Specific examples of such methods are methods of determining the effectiveness of drug or 

treatment in a subject, for instance treatment with a hormone or a drug or other treatment that relates 
to controlling obesity or diabetes. Generally, in these methods a drug or treatment is applied to the 
subject, a biological sample is taken from the subject, and the biological sample is analyzed to 
produce a test lipomic profile for the subject. This test lipomic profile for the subject is compared 

25 with a control lipomic profile (such as the control lipomic profiles discussed above), and conclusions 
are drawn about the effectiveness of the drug or treatment based on differences or similarities 
between the test lipomic profile and the control lipomic profile. 

Also provided are methods of determining likelihood of success of a treatment or procedure, 
such as an organ transplant. In such methods, the subject is subjected to the treatment or procedure, 

30 and a biological sample is taken from the subject. The biological sample is analyzed to produce a test 
lipomic profile for the subject, which is then compared with a control lipomic profile. Conclusions 
about the likelihood of success of a treatment or procedure are then drawn based on differences or 
similarities between the test lipomic profile and the control lipomic profile. 

A further embodiment is a method of screening for a compound useful in treating, reducing, 

35 or preventing a disease or progression of a disease, comprising determining if application of a test 
compound alters a disease-related lipid metabolite profile so that the profile less closely resembles a 
disease-linked profile than it did prior to such treatment, and/or more closely resembles a non-disease 
profile (one from a subject, individual, or sample taken therefrom, where the subject or individual 
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does not have the disease or condition). A compound that so alters the disease-related lipid 
metabolite profile is then selected, for instance for further testing or other study. Examples of such 
disease-related lipid metabolite profile include a level of at least one of the following metabolites (as 
a free fatty acid, or as a component of any lipid class): tetradecanoic acid, pentadecanoic acid, 
5 hexadecanoic acid, heptadecanoic acid, octadecanoic acid, eicosanoic acid, docosanoic acid, 

tetracosanoic acid, 9-tetradecenoic acid, 9-hexadecenoic acid, 1 1-octadecenoic acid, 9-octadecenoic 
acid, 11-eicosenoic acid, 5,8,1 1-eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 
9,12,15-octadecatrienoic acid, 6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 
8,11,14,17-eicosictetraenoic acid, 5,8,1 1,14,17-eicosapentaenoic acid, 7,10,13,16,19- 

10 docosapentaenoic acid, 4,7,10,13,16,19-docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic 
acid, 9,12-octadecadienoic acid, 6,9,12-octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14- 
eicosatrienoic acid, 5,8,1 1,1 4-eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,13,16- 
docosicatetraenoic acid, 4,7,10,13, 16-docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans- 
octadecenoic acid, 8-eicosaenoic acid, 5-eicosaenoic acid, plasmalogen fatty acids, 5b-cholestan-3b- 

15 ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 5,24-cholestadien-3b-ol, 5-cholestan-25a-methyl-3b-ol, 5- 
cholestan-24b-methyl-3b-ol, 5-cholesten~24b-ethyl-3b-ol, or 5,22-cholestadien-24b-ethyl-3b-ol. 

Also provided are computer-readable media having contained thereon a metabolomic 
database, wherein the database contains a plurality of records, each record including quantitative data 
for a plurality of metabolites from a biological sample. In specific examples, the metabolomic 

20 database is a lipomic database, and each record of the lipomic database includes quantitative data for 
a plurality of lipid metabolites from a biological sample, such as a sample taken from an individual, 
organism or subject undergoing or suffering from or subject to a condition. Biological samples may 
include samples from any or all of representative microbes, plants, or animals (e.g., humans, mice, 
rats, monkeys, chickens, cats, dogs, pigs, horses, cows, fruit flies, or worms.). 

25 Another embodiment is a database generated using methods described herein, where the 

database containing a profile table including a quantified metabolic profile from a biological sample 
from an individual having a condition, wherein the quantified metabolic profile includes a quantified 
measurement of a metabolite (or more than one metabolite) and wherein the quantified measurement 
is obtained using an internal standard (such as those described herein) for the metabolite so that the 

30 quantified measurement is integratable into a database. Metabolites measured and quantified in the 
metabolic profiles may be, for instance, lipids, carbohydrates, vitamins, co-factors, and pigments. 

It is contemplated that, in some embodiments, biological samples in this context will include 
a biological fluid or tissue sample. Biological samples in some embodiments are selected from 
individual subjects or archival repositories, or some of both, or from animal models. In some 

35 examples, at least some of the biological samples used to generate the database are samples taken 
from an animal, for instance, a human, mouse, rat, monkey, chicken, cat, dog, pig, horse, cow, fruit 
fly, or worm. Specific databases contain profiles generated from biological samples from different 
species, different analysis methods, etc. 
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In addition, it is specifically contemplated that some samples are obtained from an organism 
that is subject to a condition. For instance, the condition can include a trait (such as a genotype, for 
instance a genetic knockout or other mutation) of the organism from which the biological sample is 
obtained; a dietary limitation or supplementation; a disease or disease state; application of a toxin or 
5 suspected toxin; application of a pharmaceutical or therapeutic agent or candidate agent to the 

organism; an increase in exercise, a decrease in exercise, or a change in an exercise regimen of the 
subject; or some combination of these circumstances. 

In particular embodiments, the databases contains lipid metabolite data, wherein at least one 
quantified lipid metabolite is selected from the group consisting of tetradecanoic acid, pentadecanoic 
10 acid, hexadecanoic acid, heptadecanoic acid, octadecanoic acid, eicosanoic acid, docosanoic acid, 
tetracosanoic acid, 9-tetradecenoic acid, 9-hexadecenoic acid, 11-octadecenoic acid, 9-octadecenoic 
acid, 11-eicosenoic acid, 5,8,1 1-eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 
9,12,15-octadecatrienoic acid, 6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 
8,11,14,17-eicosictetraenoic acid, 5,8,11,14,17-eicosapentaenoic acid, 7,10,13,16,19- 

15 docosapentaenoic acid, 4,7,10,13,16,19-docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic 
acid, 9,12-octadecadienoic acid, 6,9,12-octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14- 
eicosatrienoic acid, 5,8,1 1,14-eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,13,16- 
docosicatetraenoic acid, 4,7,10,13,16-docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans- 
octadecenoic acid, 8-eicosaenoic acid, 5-eicosaenoic acid, plasmalogen fatty acids, 5b-cholestan~3b- 

20 ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 5,24-cholestadien-3b-ol, 5-cholestan-25a-methyl-3b-ol, 5- 
cholestan-24b-methyl-3b-ol, 5-cholesten-24b-ethyl-3b-ol, and 5,22-cholestadien-24b-ethyl-3b-ol, 
each as a compound or a component of a lipid molecule. 

Also encompassed herein is a database wherein the quantified metabolic profile includes a 
quantified measurement of a lipid in a lipid class. For instance, the quantified measurement of a lipid 

25 in a lipid class is in some instances obtained using an internal standard for the lipid class. 

In some instances, a quantified lipid is selected from the group consisting of fatty acid 16:0, 
18:0, 16:ln7; 18:ln7; 18:ln9; 18:3n3; 20:5n3; 22:5n3; 22:6n3; 18:2n6; 18:3n6; 20:3n6; and 20:4n6, 
each as a compound or a component of a lipid molecule. Other examples of lipids include a sterol 
selected from the group consisting of 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 5,24- 

30 cholestadien-3b-ol, 5-cholestan-25a-methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten-24b- 
ethyl-3b-ol, and 5,22-cholestadien-24b-ethyl-3b-ol, each as a compound or a component of a lipid 
molecule. 

Lipid classes include lyso-phosphatidylcholine, sphingomyelin, phosphatidylcholine, 
phosphatidylserine, phosphatidylinositol, phosphatidylethanolamine, cardiolipin, free fatty acids, 
35 monoacylglycerides, diacylglycerides, triacylglycerides, and cholesterol esters, for instance. 

Representative examples of such internal standards are provided herein, as is teaching to 
make internal lipid standards more generally. A particular embodiment is a database as described 
above, wherein at least one internal standard is selected from the group consisting of diheptadecanoyl 
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phosphatidylcholine, dipentadecaenoyl phosphatidylethanolamine, tetraheptadecenoyl cardiolipin, 
diheptadecenoyl phosphatidylserine, pentadecenoyl sphingomyelin, hefctadecanoyl lyso- 
phosphatidylcholine, tripheptadecaenoyl glyceride, pentadecaenoic acid, heptadecanoic cholesterol 
ester and free fucosterol. In other specific embodiments, the internal standard is heptadecanoic 1- 
5 heptadecanoyl-2-lyso-phosphatidycholine for the lipid class of lysophospholipids, N-pentadecenoyl- 
D-erythro-sphingosylphorylcholine for the lipid class of sphingomyelin, 1,2 
diheptadecanoylphosphatidylcholine for the lipid class of phosphatidylcholine, 1,2- 
diheptadecenoylphosphatidylethanolamine for the lipid class of phosphatidylethanolamine, 1,2- 
diheptadecenoylphosphatidyiserine for the lipid class of phosphatidylserine, pentadecaenoic acid for 

10 the lipid class of free fatty acids, triheptadecaenoic acid for the lipid class of triacylglycerides, 

l,r,2,2'-tetraheptadecaenoyl cardiolipin for the lipid class of cardiolipin, cholesteryl heptadecanoate 
for the lipid class of cholesterol esters and stigmasterol for the lipid class of free sterols. 

Also provided is a computer readable medium containing a database as described herein. 
One example of such a computer readable medium is one where the metabolomic database is a 

15 lipomic database, and wherein at least one record comprises quantitative data for a plurality of lipid 
metabolites from a biological sample. Examples of such databases include those in which the 
database comprises records that comprise data from animal {e.g., humans, mice, rats, monkeys, 
chickens, cats, dogs, pigs, horses, cows, fruit flies, or worms), plant, or microbial samples. 

Also described herein are databases that further include a sample item table including a 

20 sample record for the quantified metabolic profile, and a condition item table including a condition 
record for the quantified metabolic profile. Specific examples of such databases further comprise a 
genomic item table including a genomic profile for the quantified metabolic profile. Other specific 
examples further comprise an expression item table including a gene expression profile for the 
quantified metabolic profile, and/or a protein item table including a proteomic profile for the 

25 quantified metabolic profile, and/or a character item table including a character profile for the 

quantified metabolic profile, and/or a filter item table including a filter of quantified metabolic profile 
for a desired condition. 

Another embodiment is a user interface for operatively working with a processor to affect 
operation of a database as provided herein, where the user interface includes means for providing 

30 settings for selecting a set of samples, means for providing settings for selecting a set of conditions, 
means for providing settings for selecting a set of metabolites, and means for displaying quantified 
metabolic profiles corresponding to the selected samples and conditions, wherein each displayed 
quantified metabolic profile consists of the quantified measurements of the selected metabolites. 
Optionally, the user interface can further include a display area which displays the value of a 

35 quantified measurement of a metabolite within the quantified metabolic profiles of the selected 
samples and conditions. Optionally, the user interface can further include means for comparing 
quantified metabolic profiles corresponding to a first set of selected samples and conditions to the 
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quantified metabolic profiles corresponding to a second set of selected samples and conditions, and 

means for displaying the comparison. 

Specific examples of the encompassed user interfaces include, for a plurality of metabolites, 

a presentation of an observed quantity of at least one metabolite for a first biological sample with 
5 respect to an observed quantity of the at least one metabolite for a second biological sample, wherein 

the presentation is operable to accept a user indication that further information is desired with respect 

to a selected metabolite. 

Another embodiment is a computer implemented method for operating a relational database 

which method involves creating a profile table including a quantified metabolic profile from a 
10 biological sample from an individual having a condition, wherein the quantified metabolic profile 

comprises a quantified measurement of a metabolite and wherein the quantified measurement is 

obtained using an internal standard for the metabolite so that the quantified measurement is 

integratable into a database, creating a sample item table including a sample record for the quantified 

metabolic profile, creating a condition item table including a condition record for the quantified 
15 metabolic profile, and storing data in the profile table, the sample item table, and the condition item 

table, wherein each quantified metabolic profile corresponds to a sample record and a condition 

record. 

Yet a further embodiment is a computer system for analyzing quantitative lipid metabolomic 
information, which system includes a processor; and a storage medium storing a relational database 

20 accessible by the processor, wherein the storage medium has stored thereon: the relational database 
comprising: a first table including a plurality of records, wherein at least one of the records includes 
quantitative data for a plurality of lipid metabolites. Specific examples of such computer systems 
include a processor, and a storage medium storing a relational database accessible by the processor, 
wherein the storage medium having stored thereon a relational database comprising a profile table 

25 including a quantified metabolic profile from a biological sample of a condition, wherein the 

quantified metabolic profile comprises a quantified measurement of a metabolite and wherein the 
quantified measurement is obtained using an internal standard for the metabolite so that the quantified 
measurement is integratable into the relational database, a sample item table including a sample 
record for the quantified metabolic profile, and a condition item table including a condition record for 

30 the quantified metabolic profile. 

Specific internal standards and internal standard compositions, which often contain a 
mixture of two or more internal standards, are also provided. By way of example, another 
embodiment is an internal standard composition for lipid analysis of a sample, comprising a plurality 
of lipid species, wherein at least one lipid species comprises at least one monounsaturated fatty acid 

35 of formula N:lnR, wherein N is an odd integer equal to or larger than three, wherein R is any integer 
equal to or less than N-l, and wherein at least one of the plurality of lipid species is a free fatty acid, a 
sphingomyelin, a cardiolipin, a phosphatidylethanolamine, a phosphatide acid, a 
phosphytidylcholine, a phosphatidylserine, a phosphatidylinositol, a phosphatidylglycerol, a 
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monoacylglyceride, a diacylglyceride, a triacylglyceride, a sterol ester, or a lysophospholipid. In 
specific examples of these compositions, each lipid species comprises at least one such 
monounsaturated fatty acid. 

In particular example internal standard compositions, at least one of the monounsaturated 
5 fatty acids in the standard is not present in the sample. In examples of such compositions, each of the 
monounsaturated fatty acids is not present in the sample. 

Particular examples of these internal standard compositions will include at least one lipid 
species having at least one monounsaturated fatty acid, wherein N is 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 
23, or 25. 

10 Optionally, each of the plurality of lipid species in the internal standard composition 

represents a specific (for instance, a different) lipid class. In some instances, each of the plurality of 
lipid species in an internal standard is present in the composition at a concentration equivalent to 
(e.g., with an order of magnitude) the concentration of a sample lipid species (for instance, the most 
abundant, second most abundant, third most abundant, and so forth) from the same lipid class as 

15 represented by that lipid species. By way of example, the internal standard compositions may include 
at least three lipid species, at least three lipid species, at least four lipid species, at least five lipid 
species, at least eight lipid species, or at least three ten species or more. 

In particular example compositions, at least one of the lipid species is a lysophospholipid, 
and the lysophospholipid has the formula l-acyl,2-lyso-M or l-lyso,2-acyI-M, and where M is 

20 phosphytidylcholine, phosphatidylserine, phosphatidylethanolamine, phosphatidylinositol, 
phosphatidylglycerol, or phosphatidic acid. 

In still other particular example compositions, the lipid classes comprise lyso- 
phosphatidylcholines, sphingomyelins, phosphatidylcholines, phosphatidylserines, 
phosphatidylinositols, phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, 

25 diacylglycerides, triacylglycerides, cholesterol esters, phosphatidic acids, phosphatidylglycerols, 
CDP-diacylglycerols, lysocardiolipins, lysophosphatidylethanolamines, or two or more thereof. 

Also provided is an internal standard for phosphatidylethanolamines, phosphatidic acids, 
phosphytidylcholines, phosphatidylserines, phosphatidylinositols, phosphatidylglycerols, 
diacylglycerides, or triacylglycerides, comprising a first fatty acid of formula N:0 in the sn-1 position 

30 and a second fatty acid of formula M:Y in the sn-2 position, where Y is an integer greater than 0. 
One specific example is an internal standard for phosphatidylethanolamines, wherein the internal 
standard comprises a phosphatidylethanolamine that comprises the first fatty acid and the second 
fatty acid. Another specific example is an internal standard for phosphatidic acids, wherein the 
internal standard comprises a phosphatidic acid that comprises the first fatty acid and the second fatty 

35 acid. Still another example is an internal standard for phosphytidylcholines, wherein the internal 

standard comprises a phosphytidylcholine that comprises the first fatty acid and the second fatty acid. 
Yet a further example is an internal standard for phosphatidylserines, wherein the internal standard 
comprises a phosphatidylserine that comprises the first fatty acid and the second fatty acid. Another 
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example is an internal standard for phosphatidylinositols, wherein the internal standard comprises a 
phosphatidylinositol that comprises the first fatty acid and the second fatty acid. Yet another example 
is an internal standard for phosphatidylglycerols, wherein the internal standard comprises a 
phosphatidylglycerol that comprises the first fatty acid and the second fatty acid. Another provided 
5 example is an internal standard for diacylglycerides, wherein the internal standard comprises a 

diacylglyceride that comprises the first fatty acid and the second fatty acid. Still another example is 
an internal standard for triacylglycerides, wherein the internal standard comprises a triacylglyceride 
that comprises the first fatty acid and the second fatty acid. Optionally, such an example internal 
standard for triacylglycerides further includes a third fatty acid that is different from the first fatty 

10 acid and the second fatty acid. 

Another embodiment is an internal standard for triacylglycerides or cardiolipins, comprising 
a first fatty acid of formula N:X at a first position, a second fatty acid of formula M:Y at a second 
position, and a third fatty acid of formula 0:Z at a third position, wherein N:X, M:Y, and 0:Z are 
different from each other. In some examples, the first position is sn-1 and X is 0. In anther example, 

15 at least Y or Z is 1, and in specific examples, both Y and Z are 1. For instance, in one particularly 
contemplated example of such an internal standard, N:X is 17:0, M:Y is 19:1, and 0:Z is 19:1 and 
wherein the first position is sn-1, the second position is sn-2, and the third position is sn-3. In 
another, N:X is 17:0, M:Y is 19:1, and 0:Z is 19:2 and wherein the first position is sn-1, the second 
position is sn-2, and the third position is sn-3. In still other examples, the internal standard is an 

20 internal standard for triacylglycerides, wherein the internal standard comprises a triacylglyceride that 
comprises the first fatty acid, the second fatty acid, and the third fatty acid. In yet another example, it 
< is an internal standard for cardiolipins, wherein the internal standard comprises a cardiolipin that 
comprises the first fatty acid, the second fatty acid, and the third fatty acid. For instance, in such an 
internal standard for cardiolipins, the first position is sn-1, the second position is sn-2, and the third 

25 position is either sn-1' or sn-2\ By way of example, in one such internal standard the third position 
is sn-1', and X and Z are 0. 

Also provided is an internal standard composition for lipid analysis of a sample, comprising 
a plurality of lipid species, wherein at least one lipid species comprises at least one polyunsaturated 
fatty acid of formula N:lnR, wherein N is an even integer equal to or larger than six (for instance, 6, 

30 8, 10, 12, 14, 16, 18, 20, 22, 24, or 26), wherein R is any integer equal to or less than N-l, and 

wherein the desaturations occur in positions different from the positions of desaturations in fatty acids 
present in the sample, and wherein at least one of the plurality of lipid species is a free fatty acid, a 
sphingomyelin, a cardiolipin, a phosphatidylethanolamine,'a phosphatide acid, a 
phosphytidylcholine, a phosphatidylserine, a phosphatidylinositol, a phosphatidylglycerol, a 

35 monoacylglyceride, a diacylglyceride, a triacylglyceride, a sterol ester, or a lysophospholipid. In 
specific examples of such internal standard compositions, each lipid species comprises at least one 
such polyunsaturated fatty acid. 
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In specific examples of these internals standard compositions, each of the plurality of lipid 
species represents a different lipid class. For instance, such compositions can contain at least three 
lipid species, at least four lipid species, at least five lipid species, at least eight lipid species, at least 
ten lipid species, or more. 

5 In still other specific examples of the internal standard compositions, each of the plurality of 

lipid species is present in the composition at a concentration equivalent to the concentration of a 
sample lipid species from the same lipid class as represented by that lipid species. 

Also provided are specific internal standard compositions, wherein at least one of the 
polyunsaturated fatty acids is not present in the sample. In further examples, each of the 

10 polyunsaturated fatty acids in the internal standard is not present in the sample. 

By way of specific example, at least one of the lipid species in the internal standard 
compositions is a lysophospholipid, and the lysophospholipid has the formula l-acyl,2-lyso-M or 1- 
lyso,2-acyl-M, and where M is phosphatidylcholine, phosphatidylserine, phosphatidylethanolamine, 
phosphatidylinositol, phosphatidylglycerol, or phosphatide acid. In other specific examples, the lipid 

15 classes included in the internal standard composition include lyso-phosphatidylcholines, 
sphingomyelins, phosphatidylcholines, phosphatidylserines, phosphatidylinositols, 
phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, diacylglycerides, 
triacylglycerides, cholesterol esters, phosphatide acids, phosphatidylglycerols, CDP-diacylglycerols, 
lysocardiolipins, lysophosphatidylethanolamines, or two or more thereof. 

20 The internal standards described in this disclosure, including particular single internal 

standard molecules or combinations thereof or compositions containing such, can be used with the 
methods provided herein, particularly with the methods of generating quantitative lipomic data. 

IV. Metabolomics 

25 The vast potential of genomics and bioinformatics to identify genes that cause disease by 

investigating whole-genome databases is accepted. By comparing the analysis of an individual's 
genotype with a genomic database, medicine is expecting to personalize health care by providing 
drugs tailored to individual genotype. This same bioinformatic approach, when applied to the study 
of human metabolites, has the potential to identify and validate targets to improve personalized health 

30 through nutrition, pharmacology, environment, physical activity, and/or gene therapy. Advances in 
high-throughput analytical chemistry and computing technologies make the creation of a vast 
database of metabolites possible for several subsets of metabolites including lipids and organic acids. 

In creating integrative databases of metabolites for bioinformatic investigation, the current 
concept of single biomarker measurements must be expanded in three dimensions in order to: 

35 (1) include a highly comprehensive set of metabolite measurements (a profile) by multi- 

parallel analyses; 

(2) measure individuals as a function of time rather than simply in the fasted state; and 

(3) integrate these metabolic profiles with genomic, expression and proteomic databases. 
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Substantial databases of metabolite concentrations will be predictive resources to quantify 
the relationship between metabolites and health. An overview of one way in which a metabolomic 
database can be used is shown in Figure 1. In this schematic drawing, quantitative analysis is 
5 performed to assess and measure the amounts of metabolites in a biological sample. The output of ' 
the analysis is subjected to optional transformation through one or more calculation processes, 
providing a set of numeric results. For instance, if the analysis is a quantitative gas chromatograph, 
the area under the curve can be measured and the relative area of each peak determined. These 
relative areas can be converted into absolute amounts for each individual metabolite measured by the 
10 inclusion of control compounds in the analysis, as described herein. The raw and/or processed data 
are entered into a database, for instance a cumulative database that contains the results from a 
multitude of different analyses. This database can be queried in order to search for specific datasets 
from within the database, and filters (such as those provided herein) can be used to produce limited 
output in interpretable forms. Such forms may be user interfaces that permit continued interaction 
15 with the database, and/or that permit access to more information than the raw or processed results of 
individual analyses or collections thereof. In certain embodiments, the output from such a 
metabolomic database may be graphical or statistical. 

Quality control triggers may be included within the database, which flag samples that are 
outside of expected or predicted limits, or which otherwise trip a trigger so that the user of the 
20 database (and/or the individual entering the data, or a third party) is made aware of that specific 

sample. In specific embodiments, the tripping of such a trigger will indicate that the corresponding 
sample is in someway suspect, and the analysis for that sample may be repeated. 

The application of an informatic approach to the study of metabolites in individuals 
represents an important advance. Scientists currently view their goal as ultimately reductionist and 
25 strive to identify the single best biomarker that reflects phenotype. However, single biomarkers have 
shown very limited success in predicting chronic disease. This has led the inventors to the realization 
that there is a need for more global and integrated approaches for assessing metabolism. Thus, the 
study of metabolites must be redefined in parallel with genomic and proteomic analyses, as the means 
to allow researchers to measure a large number or even an entire set of metabolites. The entire 
30 metabolome, with all of its individual concentrations and quantitative infra-relationships forms the 
metabolic basis of a phenotype. Therefore, only a metabolomic approach can accurately assess the 
complex role of metabolites in defining individual health. 

In part, the reluctance to study metabolism within the framework of informatics arises from 
the inherent complexity of metabolite profiling. Although expression analysis and proteomics are 
35 responsive to the environment and are thus more complex than genomics, they are constrained, at 

least in theory, by a factor of the number of genes present in an organism. The overall metabolome is 
not confined to the products of genes, and thus, the metabolome represents a potentially massive 
inclusive set of compounds. Further, a metabolite profile for a single individual is neither constant 
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among individual cells, nor is it stable over time. Implementing a metabolomic research strategy 
involves planning for considerable complexity. This disclosure provides methods for generating 
metabolomic profiles for individual samples, for sets or subsets of the available metabolites, and 
methods of assembling such profiles into integrated, comprehensive, minable databases. 
5 The ultimate application of these approaches, of course, is to generate knowledge of 

metabolism that is faithful to the overall phenotypes that accurately reflect health, predispositions to 
disease, or other health outcomes. In nutritional terms, for example, understanding the variation in 
metabolic responses to diet is the goal of the science of nutrition. Before embarking on wholesale 
renovations of agricultural products for nutritional improvement, metabolomics is uniquely qualified 
10 to address the questions that must be answered to succeed. Although the development of this 
technology is likely to be driven by human health concerns, rapid analysis of lipids and other 
metabolite classes can be used in the support of a variety of topics including plant and animal 
breeding, characterization of transgenic crops, and fundamental science. Metabolomics will be a part 
of the future of biotechnology, nutrition, and agriculture. 

15 

V. Lipid Metabolomics 

By way of example, this disclosure focuses on analysis of lipid metabolites, generation of 
lipid metabolite profiles, lipid metabolomic databases, and the information that can be mined from 
such profiles and databases. In particular, methods are provided for developing a metabolomic 

20 database capable of producing predictive and diagnostic profiles of disease. 

Figure 2 provides an overview of a lipid metabolomic analysis system. Chromatographic 
data is entered into a database, which can be mined for desired information and presented in the form 
of a graphical interface (for instance a heat map or targeting chart, as shown in the second panel of 
Figure 2). Such interfaces may be optionally provided in interactive form, for instance on a computer 

25 system, or remotely across the Internet or another computer communication system. Data mined 
from the cumulative lipid metabolite database can be used, for instance, for clinical or diagnostic 
testing {e.g., for a propensity to obesity or another biological condition that impacts or is impacted by 
lipid metabolism), or to identify specific metabolic targets of drugs, as described in more detail 
herein. 

30 Present analytical methods, including those disclosed herein, can produce a spectrum of data 

easily developed into a metabolomic database. For instance, fatty acids, glycerolipids, sterols, and 
numerous bioactive lipid mediators (including products of epoxygenase, lipoxygenase and 
cyclooxygenase pathways) are quantifiable in biological samples. Thus, as demonstrated clearly 
herein, a few parallel analyses are capable of defining an essentially complete lipid profile of a 

35 sample. 

Lipids are an attractive subset of metabolites for metabolomic applications. In addition to 
their ubiquitous cellular functions as structural, energetic, and bioactive signaling molecules, lipids 
are reflective of both diet and metabolism. The major fatty acids in human metabolism and the 
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enzymes that modify them are depicted in Figure 3. Fatty acids are an interesting subject matter for 
metabolomics because they are the only major macronutrients to survive digestion intact, and yet 
humans possess the biochemical machinery to process dietary fatty acids further into new forms of 
fatty acid. As a result, the fatty acid composition of tissues and fluids reflects the influence of both 
5 diet and metabolism. By quantifying the fatty acids present in human plasma, for example, a 
researcher could determine the dietary preferences of that individual. Alternatively, and perhaps 
more interestingly, a researcher could assay endogenous lipid metabolism by comprehensive lipid 
analysis, because every lipid substrate and product is measured simultaneously from a single sample. 
Thus, a unique aspect of lipid metabolomic analysis is that the information yielded by an experiment 
reflects the ultimate expression of genomics, proteomics, and environment as a lipid metabolome. 

Because current technology allows for the comprehensive analysis of lipid composition in a 
sample, metabolic interpretations can be extended to the activities of the enzymes that modify lipids. 
Quantitative analysis of fatty acid concentration provides data on not only the fatty acids but also on 
the relative activities of the desaturases and elongases that modify them. Moreover, a quantitative 
analysis of fatty acids from individual glycerolipid classes yields data on the mass of each 
glycerolipid class, thereby enabling the investigation of pathways involved in glycerolipid 
metabolism. The ability to not only profile diseases, but also to identify the complex metabolic 
dysregulations involved in that disease, using the methods provided herein is a major advance for 
medicine. 

The utility of metabolite profiling is not limited to making assessments about the status of 
individuals. Particularly, one advantage of metabolomics is the potential to use a metabolomic 
database as a tool for in silico investigations. The availability of such databases will be particularly 
helpful for applying bioinformatic approaches to nutrition, pharmacology, and toxicology, because 
once a metabolic profile is developed for a specific nutritional or otherwise affected state, it can be 
compared with the metabolomic database to determine the relationships among diet, drugs, toxins, 
treatments, genotype, and phenotype. The ability to mine large databases in silico will be an 
advantage of metabolomics to nutrition, because testing every conceivable nutrient by single clinical 
trials is not possible. 

Moving from single biomarkers to metabolomic analysis is a necessary step inasmuch as 
many approaches to lowering the unilateral risk of one disease in an individual simply increase the 
risk of another disease in that individual. A pertinent example of this problem is the change in 
nutritional recommendations from high fat to high carbohydrate diets. It is widely understood that 
high fat diets increase serum low-density lipoproteins and thus the risk for cardiovascular disease in 
most individuals; however, high carbohydrate diets increase serum triacylglycerides (Kasim-Karakas 
et al.,Am. J. Clin. NutrL 71:1439-1447, 2000) and the risk for cardiovascular disease in a subset of 
the population, particularly some women (Liu et at* Am. J. Clin. NutrL 71:1455-1461 2000). By 
measuring every metabolite involved in lipid metabolism, subtle differences in the predisposition or 
progression of disease among individuals will be elucidated. The broader and much more exciting 



: ) 

WO 03/005628 



PCT/US02/21426 



-23- 



aspect of this technology is thus the generation of metabolic profiles that are not simply markers for 
disease, but metabolic maps that can be used to identify specific genes or activities influential in the 
progression of disease or the maintenance of overall health. In this way, metabolomics is a subset of 
functional genomics. The value of genomic, expression, proteomic, and metabolomic databases in 
5 predicting phenotype will be enhanced dramatically by their horizontal integration into global 
bioinformatic databases. 

VI. Application of Lipid Metabolomics to Predictive Medicine 

Relative to biomolecules, biochemical science has very few ways to quantify phenotype. 

10 Alternatively, medicine has, at its very core, a system for identifying, categorizing and recording 
phenotypic information about individuals. Because science has become exceedingly adept at 
quantifying large numbers of molecules at an astonishing rate of throughput, science and medicine 
should couple their expertise to develop this metabolite-phenotype relationship. By developing a 
database that (1) allows clinicians to input patient information and (2) allows high-throughput science 

15 to contribute analytical data, powerful new predictive and analytical tools are enabled. 

The data from a comprehensive lipid analysis produce information useful for this purpose. 
The applications of a quantitative lipid database are myriad. In one variation, the data from 
comprehensive lipid analyses are used to generate biomarkers of a selected phenotype. These 
biomarkers are not, as traditionally defined, single measurements, but rather complex lipid metabolite 

20 profiles that include a large number of metabolites and even relations between metabolites. These 
profiles, when compared between experimental groups, generate a series of significant differences 
that can be used to construct reliable database filters. A database filter is essentially a way of 
discriminating a set of subset of data, and selecting this data from the database for instance for 
display or further analysis. Simple filters can comprise as few as one specified discriminating 

25 variable, for instance the gender of the individual providing the sample, or the age, or a treatment 
compound. More complex filters, using more than one discriminating variable at a time, are also 
contemplated. In specific instances, the filter can include a list of the most consistent and unique 
metabolite concentrations or interactions that exist between experimental groups (e.g., a filter can be 
based on a profile, such as a condition-linked profile). These differences and interactions are 

30 determined by standard statistical methods. 

Database filters for specific phenotypes can serve many purposes. First, using discriminant 
analysis or an analogous statistical technique, a database filter can identify entries in a database that 
match a phenotype of interest. This is an essential element to metabolomics and informatics in 
general, because it allows scientist to query a database of individuals that were not specifically tested 

35 for the phenotype of interest. 

A second purpose for creating a list of reliable and unique differences between experimental 
groups (a database filter) is to identify the points in the lipid metabolism pathways most closely 
linked with a phenotype. As an example of this approach, a researcher might perform an experiment 



WO 03/005628 



PCTAJS02/21426 



-24- 



to determine the complete lipid profile of patients with type II diabetes. These data would be 
recorded with all of the phenotypic and clinical information relevant to the patient in a database. At a 
later point in time, another researcher could generate metabolic profiles for individuals consuming 
different foods, such as dietary olive or fish oils, respectively, and enter this information into the 
5 same database. Both researchers would now have the ability to identify groupings of patients that 
match either diabetic or dietary profiles. Once the data are collected, it is a simple matter of asking 
the appropriate question in silico to determine if there are relations between dietary oil consumption 
and diabetes. Additionally, the identified differences act as clues for the metabolic basis of the effect. 
There are innumerable advantages to an in silico approach such as outlined above, including 
10 increased statistical power, the avoidance of cumbersome financial and practical limitations to 

experimentation, and the ability to re-assess data as new information emerges. Subject matching, 
dataset selection, and the grouping of experimental sets can all be done through in silico querying. It 
is expected that unanticipated relationships between diet, metabolism, and phenotype will quickly 
emerge. 

15 

V77. Metabolomics as Functional Genomics 

Another aspect of the provided technology is the generation of metabolic profiles that are 
not simply markers for disease, but are metabolic maps that can be used to identify specific genes or 
biochemical activities that cause or influence a disease state. Metabolomics is in essence functional 
20 genomics from metabolite analysis. By defining the metabolic basis for phenotype using the 
techniques described, extraordinary opportunities to understand and treat diseases are provided. 
Much in the same way that gene chips allow researchers to observe the complex expression response 
to a stimulus, metabolomics enables observation of the complex metabolic interplay responsible for 
defining phenotype. 

25 By extending this approach beyond the observation of individual metabolic dysregulations, 

medicine will begin to profile not single diseases, but health. As health is the proper balance of all 
vital metabolic pathways, comprehensive or metabolomic analysis lends itself to identifying 
metabolite distributions necessary for health. Comprehensive and quantitative analysis of lipids 
provides this degree of diagnostic power to researchers and doctors interested in mining metabolic 

30 profiles, and databases containing a plurality of such profiles, for biological meaning. 

VIIL Samples and Sample Processing 

Any sample that contains or may contain the metabolites of interest can be used for the 
analyses provided herein. For instance, samples suitable for inclusion in a quantitative lipid 
35 metabolite database include plasma, serum, tissues or cells from plants, humans or research animals 
(including mouse, rat, non-human primate, pig, chicken or other). The samples may be those from 
plants, humans or other animals, which may optionally have been subjected to pharmacological, 
genetic, toxicological or nutritional intervention. In other embodiments, the samples are from 
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humans or research animals expressing specific traits, for instance those suffering from a disease or 
condition, or displaying a level of athletic performance. 

It is particularly contemplated herein that biological samples may be in vitro cell cultured 
samples, which have been subjected to differential treatment with drugs or potential drugs, or with 
5 any potentially useful pharmaceutical agent (for instance, which might be contemplated as being 

tested for use as a drug), or with a toxin or other stressor or organic or inorganic substance that might 
be expected to cause some change in the metabolome of the subject cell culture. 

The processing of individual samples will be governed at least in part by what type of 
sample is used. Methods of harvesting biological samples are well known to those of ordinary skill 
10 in the art, and those appropriate for use with the provided methods are conventional. Methods for 
preparing the harvested samples for analysis will be influenced by the analysis being performed in 
order to quantify the metabolite(s) of interest. Those of ordinary skill in the art know systems that 
can be used to isolate (at least relatively) specified classes of molecules. 

Optionally, biological samples for use in the provided methods can be stored prior to 
15 preparation and analysis, for instance by freezing, for instance under cryogenic conditions. 

It is contemplated that sample preparation may be carried out by someone other than the 
party that carries out the analysis of metabolites in that sample. Thus, this disclosure includes 
systems in which a sample is harvested, processed at least to a point at which it can be shipped to a 
remote location, and then the processed (or partially processed) sample is transported to a facility at 
20 which the metabolites are assayed. By way of example, the samples may be transported while frozen. 

Likewise, the treatment of subjects prior to harvesting of biological samples may be carried 
out at the same facility that harvests the sample, but this is not necessary for the methods described 
herein. 

25 IX. Individual Sample Analysis 

Several aspects of lipid analysis have been modified, as provided herein, for use in lipid 
metabolomics. Currently, most fatty acid analyses are performed by gas chromatography, a 
technique that provides exquisite separation and quantification of analytes. However, researchers 
continue to report their results as a percentage of total fatty acids. Data in this format are not 

30 comparable between experiments, nor is it comparable between individual lipid classes within an 
experiment, and therefore are not integratable into a database. For example, a scientist interested in 
the metabolism of oleic acid could not determine the distribution of oleic acid among lipid classes in 
plasma from mole percentage or weight percentage data. For quantified measurements of lipid 
metabolomics as provided herein, data produced in each experiment is expressed as a concentration, 

35 for example, micrograms per milliliter, so that a consistent and comparable database of lipids can be 
assembled from multiple experiments. 

In certain embodiments, the data includes quantitative measurements of the fatty acids that 
are organized (or can be organized) by lipid classes. Because lipid classes in some embodiments are 
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separated prior to fatty acid quantification, the composition of a sample is determined in great detail. 
The results of a single analysis may include the mass or concentration of more than 35 individual 
fatty acids from each lipid class present in the sample. The total mass or concentration of the lipid 
class also may be quantified. 
5 Separation and quantitative data can be produced via chromatography using many methods, 

such as gas and liquid chromatography, including high-performance liquid chromatography, thin 
layer chromatography, capillary and gel electrophoresis, and combinations of two or more of these 
methods. See, for instance, methods described in various text and reference analytic chemistry 
books, such as chapters 22-24 of Quantitative Chemical Analysis by D.C. Harris (W.H. Freeman and 

10 Co., 4 th , 1995; ISBN 0-7167-2508-8). Choice of separation and quantitation methods may be 
influenced by the metabolites being measured. 

By way of example, the following methods can be used for generating quantitative lipid 
metabolite data from biological samples. The chromatographic conditions, internal standard 
compositions and amounts, derivatization reactions, extraction conditions, sample amounts, and so 

15 forth can be varied by those of ordinary skill in the art. The following description provides an 
overview of certain non-limiting methods that can be used for analysis of lipid metabolites in a 
biological sample. 

A. Lipid Analysis 

20 Extraction: The lipids from various samples, such as plasma, serum, tissue, or cells, can be 

extracted using a fluid extractant comprising a non-polar component and a polar component. By way 
of example, lipids are extracted from plasma, serum, tissues, and cells by the method of Folch et al 
{J. Biol Chem. 226, 497-509, 1957). By way of example, about two hundred microliters of plasma or 
serum, or about 50 mg of tissues or cells are added to a homogenizer for a single analysis, though 

25 larger or smaller amounts can be used. 

To each sample, the appropriate masses of internal standard such as those provided herein 
are added, as well chloroformrmethanol (2:1 vol/vol). In general, the internal standards are 
compounds that share a lipid class with the target metabolites (Le. an internal standard for 
triacylglyceride metabolites is itself a triacylglyceride), but have fatty acids as constituents that are 

30 not present in the sample being analyzed. An internal standard for any given lipid class is selected to 
behave sufficiently similarly to the target metabolites such that there is essentially no discrimination 
(selective loss or retention) of the internal standard relative to the target metabolites at any step of the 
analytical process before the analysis. The fatty acid moiety of the internal standard compound will 
also generally be different than the fatty acids present in the lipid class analyzed from the sample, so 

35 that the internal standard fatty acid can be separated completely from the target compound fatty acids 
by the analysis. According to specific provided embodiments, at least one unique internal standard is 
used for each class of lipid separated. 
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By way of example, the solution mixture consisting of sample, fluid extractant, and internal 
standard(s) is homogenized, for instance by twelve strokes with a ground-glass homogenizer. 
Following homogenization, potassium chloride (e.g., 1.8 ml of 0.01 M) is added, and the solution 
vigorously mixed. The organic fraction containing the lipids and the internal standard(s) is separated 
5 from the polar fraction of the mixture by centrifugation. The lipid extract can then be removed from 
the mixture and, as needed, concentrated under a stream of nitrogen in preparation for lipid class 
separation. 

Internal Standards: Internal standards for use in the provided methods may take many 
forms. In certain embodiments, lipid classes that separate adjacent to each one another during lipid 

10 class separation have internal standards that contain fatty acids that are different than the fatty acids 
within the internal standard of the adjacent lipid class. This allows one to check for cross- 
contamination and complete separation of lipid classes by looking for the presence of the internal 
standard of one lipid class in the analysis of the adjacent lipid class. 

In some embodiments, saturated fatty acids are employed as internal standards for the 

15 analysis of sphingomyelin, lyso-phospholipids (provided they are l-acyl-2-Iyso-phosphoiipids) and 
cholesterol esters, while saturates and monounsaturated fatty acids are used as internal standards for 
diacylglycerides, monoacylglycerides and free fatty acids. 

Optionally, the internal standards provided by the present disclosure are added to each 
sample such that the fatty acids derived from the internal standard prior to the analysis are present at 

20 concentrations that approximate the second most concentrated fatty acid in the analyzed lipid class of 
the biological sample. This helps to ensure that the internal standard provides accurate data for 
quantifying the fatty acids and, provided the concentration of the sample is appropriate, that both the 
analytes and the internal standard induce a response from the detector that is within its linear and 
quantitative range. 

25 Various fatty acids are particularly contemplated as internal standards, including fatty acid 

saturates, e.g., 3:0, 5:0, 7:0, 9:0, 11:0, 13:0, 15:0, 17:0, 19:0, 21:0, 23:0, 25:0, and 27:0, and fatty acid 
monounsaturates,e.£.,5:l,7:l,9:l, 11:1, 13:1, 15:1, 17:1, 19:1, 21:1, 23:1, and 25:1. In particular 
embodiments, internal standards will include 17:0, 19:0, 15:1, 17:1 and 19:1. 

In addition, polyunsaturated fatty acids may be used as internal standards, provided that they 

30 are odd-carbon numbered chains 3:2, 5:2, 7:2, 9:2, 1 1:2, 13:2, 15:2, 17:2, 19:2, 21:2, 23:2, 25:2, 
and 27:2, as well as 5:3, 7:3, 9:3, 11:3, 13:3, 15:3, 17:3, 19:3, 21:3, 23:3, 25:3, 27:3, and so forth for 
additional unsaturations). 

The position of unsaturated bond(s) within the fatty acid(s) of a standard can be varied to 
produce a large variety of internal standard compounds. For instance, 15:ln7 and 15:ln9 are distinct 

35 fatty acids that share highly similar physical properties. For instance, if 15:1 fatty acids provide the 
physical properties that best mimic lipid classes that also happened to separate next to each other 
during lipid class separation, one of these two fatty acids (15:ln7 and 15:ln9) could be used in one 
class, while the other fatty acid could be used in the second lipid class. Thus, because metabolite 
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analysis is capable of separating 15:ln7 from 15:ln9, the degree of cross-contamination or separation 
of the two lipid classes could be determined. In general, it is useful if the position of the double 
bond(s) in the fatty acid(s) is unique relative to the composition of the biological sample, thus 
facilitating distinguishing these compounds in the final analysis and/or quantification. 
5 In some embodiments, a mixture of internal standards is used to control different aspects of 

the analysis, e.g., positional specificity or compositional variation. For example, complex lipids that 
contain more than one fatty acid per molecule, such as phospholipids and triacylglycerides, typically 
contain defined types of fatty acids in specific positions on the lipid molecule. For instance, saturated 
fatty acids comprise more than 90% of the fatty acids on the sn-1 position (the first carbon on the 

10 glycerol backbone) of phosphatidylcholine in most biological samples, while unsaturated fatty acids 
comprise more then 95% of the fatty acids present in the sn-2 position of phosphatidylcholine. Thus, 
to improve the physical properties of an internal standard for phosphatidylcholine, it may prove 
useful to construct an internal standard molecule such that it contains a saturated fatty acid in the sn-1 
position and an unsaturated fatty acid in the sn-2 position. This approach can be used to improve the 

15 physical properties of the internal standard to better match those of natural compounds. 

Many types of chromatography can selectively deplete fatty acid molecules based on the 
number of double bonds present in the fatty acid, or on the number of carbons in the fatty acid. By 
constructing internal standards with a variety of fatty acids of varying unsaturation and chain length, 
these internal standards can control for these selectivities. For example, if internal standards are 

20 constructed with different fatty acids, e.g., with the different fatty acids present on the same 

glycerolipid molecule, such as triacylglyceride with a 17:0 on the sn-1 position, a 19:1 on the sn-2 
position and a 19:2 on the sn-3 position, the extent of loss of fatty acids of varying unsaturation or 
chain length during analysis can be calculated and used to correct the final data for improved 
quantification. Thus panels of internal standards for each lipid class can be constructed with 

25 knowledge of the typical biological composition of the lipid class. 

By way of specific example, this disclosure particularly contemplates internal standards 
including diheptadecanoyl phosphatidylcholine, dipentadecaenoyl phosphatidylethanolamine, 
tetraheptadecenoyl cardiolipin, diheptadecenoyl phosphatidylserine, pentadecenoyl sphingomyelin, 
heptadecanoyl Iyso-phosphatidylcholine, tripheptadecaenoyl glyceride, pentadecaenoic acid, 

30 heptadecanoic cholesterol ester and free fucosterol, either individually or a combination thereof. 

Separation of Lipid and Phospholipid Classes: The separation of lipid classes can be 
performed by preparative thin-layer chromatography (TLC), for instance using methods described 
herein. 

To remove any residual metal or other damaging contaminants that might be on the TLC 
35 plates, each plate is washed prior to use. By way of example, the following three-step method can be 
used to wash the plates: impregnate each plate with ethylenediamine tetraacetic acid (EDTA), then 
rinse the plates once with methanol and once with chloroform. Each plate is first impregnated with 1 
mM EDTA, pH 5.5, by ascending development using the method of Ruiz and Ochoa (/. Lipid Res. 
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38, 1482-1489, 1997). After each plate is completely developed, it was dried in air overnight Once 
dry, each plate is developed in methanol, dried, and developed in chloroform, each in the same 
direction as the development with EDTA. The washed plates are then dried in air. Just prior to use, 
each plate is activated by heating it to 1 10 °C for 10 minutes. 
5 To prepare the TLC chamber for chromatography, Whatman™ (Clifton, NJ) filter paper is 

cut, for instance into 20 x 80-cm strips, and wrapped around the inside wall of a glass development 
chamber (e.g. y a chamber of 30 x 60 x 10-cm). An appropriate amount (e.g., 100 milliliters for the 
example container) of the desired mobile phase is added to the chamber, and the chamber sealed and 
allowed to equilibrate. Chambers are generally considered equilibrated when the solvent front has 

10 completely ascended the filter paper. 

One representative mobile phase that can be employed for the separation of phospholipid 
classes is a modification of the solvent system described by Holub and Skeaf ("Nutritional regulation 
of cellular phosphatidylinositol" in Meth Enzym., ed. Conn (Academic Press, Inc., Orlando), pp. 
234-243, 1987) consisting of chloroform/methanol/acetic acid/water (100:67:7:4, by vol). For the 

15 separation of neutral lipid classes (total phospholipids (PL), free fatty acids (FFA), free sterols, 

triacylglycerides (TAG), diacylglycerides, monoacylglycerides and cholesterol esters (CE)), a solvent 
system consisting of petroleum ether/diethyl ether/acetic acid (80:20:1, by vol) can be used 
(Mangold, Thin Layer Chromatography- A Laboratory Handbook (Springer- Verlag, New York), 
1969). 

20 After the TLC plates are cooled, sample extracts are spotted onto the activated plate. In 

certain embodiments, samples are spotted at an estimated concentration such that no single lipid class 
will be present at more than 25 \ig per centimeter of plate width following chromatography. This 
helps to ensure that the plate is not overloaded and minimized the risk of cross-contamination 
between lipid classes. (Cross-contamination is readily identified during sample analysis, particularly 

25 were each lipid class contains at least one unique internal standard as described herein.) Lipid class 
separations are performed on TLC plates, for instance with a 10-cm separation length, while PL class 
separations are generally performed on longer TLC plates, for instance with a 20-cm separation 
length. 

Because lipid visualization reagents invariably degrade certain analytes, most notably the 
30 polyunsaturated fatty acids, the identification of individual lipid classes is performed by comparison 
with authentic lipid standards chromatographed in reference lanes. Each reference lane is spotted 
with a mixture of authentic lipid standards (obtained from Avanti Polar Lipids, Alabaster, AL). 
When the amount of sample is not limiting, the sample extract also may be spotted onto the reference 
lanes. 

35 Once the TLC plates are spotted with samples and standards, and the tanks are equilibrated, 

the plates are transferred into the tank containing the selected mobile phase. The samples re 
chromatographed until the mobile phase ascended to 1-cm below the top of the plate. 
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Once the TLC plate is developed, the reference lipids are visualized by cutting the reference 
lanes from the plate, dipping the reference lanes in 10% cupric sulfate/8% phosphoric acid and 
charring the reference lanes at 300 °C. The charred reference lanes are used to identify the location 
of lipid classes on the analytical plate. In order to preserve the quantitative aspect of the sample 
5 analysis, this procedure meets the following criteria: 1) reference standards co-migrate with sample 
analy tes with great accuracy, regardless of the source or composition of the analy tes (for instance, see 
the mirrored control and experimental chromatograms shown in Figure 4B); 2) chromatographic 
separation between the lipid classes is maximized to substantially avoid cross-contamination; and 3) 
the portion of the plate containing analytes is not exposed to environmental stresses such as air, light 
10 or any reagent that would cause the degradation of specific analytes. 

Derivatization: Once the individual lipid classes are separated, the fatty acids are 
hydrolyzed from their respective glycerolipids and prepared for gas chromatography. In one 
particular embodiment, and merely by way of example, each lipid fraction is scraped from the TLC 
plate using a clean razor blade and placed in a 2-mL glass vial or like container. Four-hundred 
15 microliters (400 uL) of 3N methanolic-HCl (Supelco, Bellafonte, PA) are added to each vial, and the 
vials are sealed under nitrogen. The sample vials are incubated at 100 °C for 45 minutes in order to 
trans-methylate the fatty acids. After incubation, the vials are cooled at 4 °C for 20 minutes. 

The resultant fatty acid methyl esters (FAMEs) are extracted from the transmethylation- 
mixture with hexane. For instance, and by way of example, five-hundred microliters (500 uL) of 6% 
K 2 C0 3 (w/v) and 200-uX of hexane, containing 0.05% butylated hydroxytoluene or another 
antioxidant, is added to each vial, and the vials are sealed and mixed on a vortex mixer. The sample 
mixture is then centrifuged at 500 x g to separate the hexane fraction, which contains the FAMEs, 
from the methanol/water fraction. The hexane containing the FAMEs is removed, and for instance 
transferred into 200-uL conical inserts and sealed in 2-mL glass tubes under nitrogen in preparation 
for gas chromatography. Samples may be concentrated by drying under a stream on nitrogen as 
necessary. 

Chromatography Fatty acid methyl esters can be separated and quantified using known 
techniques, for instance by capillary gas chromatography using a Hewlett-Packard (Wilmington, DE) 
6890 gas chromatograph. By way of non-limiting example, analysis may be performed using such a 
gas chromatograph equipped with a 30-m DB-225MS capillary column (J&W Scientific, Folsom, 
CA), and a flame-ionization detector. 

Separation conditions can be determined by one of ordinary skill in the art. Representative 
example conditions are as follows: The injector temperature is set to 270 °C and the detector 
temperature set to 280 °C. The oven temperature is increased from 165 °C to 215 °C at 4.0 °C per 
minute and held at 215° for 12 minutes. The temperature is then increased to 230 °C at 30 °C per 
minute and held at that temperature for three minutes to drive off any high-boiling contaminants. 
Split ratios are maintained at about 40:1. 
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Sterols can be separated and quantified by capillary gas chromatography using a Hewlett- 
Packard (Wilmington, DE) 6890 gas chromatograph equipped with a 30 m DB-35MS capillary 
column (J&W Scientific, Folsom, CA), and a flame-ionization detector. Appropriate example 
separation conditions are as follows: The injector temperature is set to 310 °C and the detector 
5 temperature is set to 280 °C. The oven temperature is increased from 285 °C to 320 °C at 2.5 °C per 
minute. The temperature is then increased to 335 °C at 50 °C per minute to drive off any high- 
boiling contaminants. Split ratios are maintained at about 100: 1 . 

The column and oven conditions may be subject to slight modification over the course of the 
experiment. In particular, modifications may be necessary to ensure that every fatty acid is 
1 0 completely resolved to baseline. 

Sample chromatograms generated using the above methods are shown in Figure 4. 

Optionally, a sample containing known amounts of a set of standard compounds can be run 
through the analysis in like fashion, to produce a control chromatogram. Such a control 
chromatogram is shown in Figure 4B; the constituent standard compounds are indicated. 

15 

B. Integration and Data Handling 

Following chromatography, each chromatogram is integrated, for instance using Hewlett- 
Packard (Wilmington, DE) ChemStation™ software. After chromatogram integration, the 
chromatogram from each sample may be visually checked to ensure proper integration. The resultant 

20 data may be sent electronically to database or spreadsheet for manipulation, for instance an Excel 
2000 (Microsoft Corporation, Redmond, WA) spreadsheet. In some embodiments, the database or 
spreadsheet contains the sample identification information, quality control algorithms, and the 
algorithms required to convert the raw chromatogram data to mass or concentration data. 

Appendices I and II show a single entry in an example database for control and test samples, 

25 respectively. The data structure for this specific database embodiment is discussed in more detail 
below. 



C. Quality Control 

Several quality control protocols can be used in the described methods, to help ensure 

30 accurate, quantitative data from samples. 

The rationally designed internal standards employed by the methods described herein enable 
true quantification of each fatty acid from each lipid class, whereas traditional lipid analysis methods 
produce data in either a percent-of-total format or as a mixed population of lipid metabolites. 
Quantitative analysis of such a mixed population of lipid classes is an analytical impossibility unless 

35 each individual class acts essentially identically at every analytical step. In addition to enabling each 
analysis to be highly quantitative, internal standards are designed to reflect any loss of fatty acid due 
to oxidation, discrimination, or cross-contamination. The results of each sample integration are 
analyzed by an Excel 2000 macro to determine if degradation or selective loss has occurred during 
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the analysis. The macro automatically flags samples with standard profiles deviating by more than 
2% from ideal analytical results for any fatty acid of lipid class. Flagged samples are entirely re- 
analyzed. 

5 X. Integrated Metabolomic Databases 

For metabolomics to develop a global knowledge base analogous to the genome knowledge, 
it is imperative that data be produced and reported in quantitative terms. Typically in the past, 
metabolite data has been reported in a percent-of-total or other relational format. Such data have 
several disadvantages, including that they (1) are influenced by the number of analytes in the tested 
10 sample, (2) are influenced by co-variation between analytes, (3) are not comparable between 

experiments and (4) provide little basis for interpreting how metabolites interact among themselves 
and with other biomolecules. The quantitative data can be integrated from multiple sources (whether 
it is work from different labs, samples from different subjects, or merely samples processed on 
different days) into a single seamless database, regardless of the number of metabolites measured in 
15 each discrete analysis. Thus, abandoning rigorously quantitative methodology in return for high- 
throughput analyses would yield fragmented and non-integratable databases. 

Further embodiments of the disclosure include databases of metabolomic data, where each 
database includes that metabolite quantification data from a plurality of individual lipid metabolite 
profiles. Such databases may be on a computer-readable storage medium, and may be formatted for 
20 processing by a computer. Data included in the databases may include any or all of the following: 
information that provides for unique identification of data from a sample; 
raw quantitative measurements of individual metabolites (such as lipid metabolites); 
transformed measurements of individual metabolites (which have been subject to one or 
more mathematical transformations from raw data); 
25 basic information about the biological sample (e.g., species, tissue, preparation date, etc.); 

genetic information about the subject from which the biological sample was taken (e.g., 
genotype of a knockout or otherwise engineered animal); 

health or care history of the subject from which the sample was taken (e.g., long term care 
strategies, chronic conditions, etc.); 
30 information about the treatment of the subject from which the biological sample was taken 

(e.g., drug application, feeding schedule or diet, stressors, environment, or toxins); 

information about the harvesting of the individual sample and/or the processing of the 

sample; 

information about the individual lipid metabolites (e.g., biochemical or biological 
35 characteristics); 

information about one or more of the implicated metabolic pathways; 
one or more metabolite fingerprints that are associated with a disease, condition, treatment, 
gene (or genotype), or drug application (e.g. 9 to serve as a baseline or control sample); 
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information linking the treated or test samples to their experimental control samples; 
information about the analytical process of producing data; and/or 
information about the laboratory, investigator and analytical chemists responsible for 
producing the data. 

5 The provided databases may serve to organize metabolite information, or any of the other 

information types indicated, in one or more tables. Such tables are readily translatable into database 
languages such as SQL, and the databases optionally can be integrated with an on-line Internet site 
containing results of user-defined metabolite analyses. 

According to one aspect of the present disclosure, a computer-readable storage medium is 
10 provided, with a relational database stored on this medium. The relational database includes a 
metabolite table, for instance containing test metabolite data, which includes a plurality of 
quantitative lipid analysis records. Each record in the table includes data that corresponds to the level 
of a lipid metabolite in the corresponding sample. 

In some embodiments, the relational database includes more than one table, for instance a 
15 control table and a test table. In some embodiments, many tables are included, for instance one each 
for a plurality of the different types of information described above. In some embodiments for 
instance, each lipid class is separated into its own table and the column headers for data are fatty acid 
names. 

In still another embodiment the data (including additional phenotypic or biochemical data) 
20 can be stored in many related tables, with each table representing a subset of the data in its totality. 
For example, consider an experiment in which athletes and non-athletes are assayed for lipid 
metabolite profiles and resting heart rate. One format of the resultant database contains a table for 
each lipid class assayed by the methods described herein, with columnar data including each 
individual fatty acid found in each lipid class, and may also include a related table for phenotypic 
25 information, in this case resting heart rate. In this example, the results obtained from athletes and 
their non-athletic controls can be stored in the same table, or in a separate series of tables. The 
preferred embodiment would allow the two groups to be stored in the same table under unique 
identifying codes such that they could be queried and identified and discriminated as treatment and 
control from a single experiment at a later date. 
30 Filters can be defined for sorting data in the provided databases, in order to mine the data. 

Examples of filter criteria based on the types of fatty acids include the following: 

(1) Fatty Acid Family: In an embodiment using this filter, each fatty acid family is 
a filter criteria. Families may be coded by color. One representative color scheme is as follows: 
Black-"Saturated M ; Maroon-"n7"; Blue- M n9"; Yellow- "Misc."; Green- "n3"; Red- H n6 M ; grey-"Trans"; 

35 Light blue- "Plasmalogen"; 

(2) Summary Data (summarized, for instance, by lipid family, fatty acid family, 
tissue, species, etc*); 



WO 03/005628 



) 

PCT/US02/21426 



-34- 

(3) Major Fatty Acids Only: This filter displays only data from the following Fatty 
Acids: 16:0; 18:0; 16:ln7; 18:ln7; 18:ln9; 18:3n3; 20:5n3; 22:5n3; 22:6n3; 18:2n6; 18:3n6; 20:3n6; 
and 20:4n6. 

The database format and implementation is not essential to certain elements of the 
5 disclosure. It is expected that different end users will require different systems for displaying data 
that are produced by the methods described herein. For instance, a specific requested display feature 
might dictate that the database format described herein be changed. Such modifications in database 
structure are known to one of ordinary skill in the art. 

By way of example, one format is described below. This format is set up for speed 
10 purposes, so that the application does not need to query each value separately from the database. In 
this embodiment, the following information is stored for each control/treatment sample: 

1) A unique auto-incrementing "id" field; 

2) An integer value corresponding to the number of rows of data; 

3) An integer value corresponding to the number of columns per row; 

15 4) A string representing an identifier for the data (the name of the data); and 

5) The data itself, which is stored in row-major order as a comma delimited list of values. 
In this embodiment, being able to correlate two sets of data {e.g., comparing two heart tissue 
samples) is based on the labels matching. This database structure requires only two queries to the 
database before values can be computed, instead of some database formats that require on the order 
20 M*N queries, where M is the number of rows and N is the number of columns per row. 

The following tables (Tables 4 and 5) present MySQL descriptions for specific 
embodiments: 

Table 4: 
25 mysql> describe controls; 



Field 


Type 


Null 


Key 


Default 


Extra 


controlid 


int(ll) 




PRI 


NULL- 


auto_increment 


rows 


int(ll) 


YES 




NULL 




cols 


int(ll) 


YES 




NULL 




name 


varchar(50) 


YES 




NULL 




data 


text 


YES 




NULL 





5 rows in set (0.00 sec) 
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Table 5: 

mysql> describe treatments; 



Field 


Type 


Null 


Key 


Default 


Extra 


treatmentid 


int(ll) 




PRI 


NULL 


auto_increment 


rows 


int(ll) 


YES 




NULL 




cols 


int(ll) 


YES 




NULL 




name 


varchar(50) 


YES 




NULL 




data 


text 


YES 




NULL 





5 rows in set (0.00 sec) 



5 Certain embodiments of the provided databases contain at least two tables (for instance, one for 
controls and one for treatments), though many more tables are also contemplated. 

XL Analysis/Mining of the Database 

The database can be mined by one of many standard statistical techniques. Such techniques 

10 may include standard difference testing between or among subsets of the data selected by the user. In 
certain embodiments, appropriate techniques include tests such as ANOVAs, general linear models 
(GLM), Student's t-tests, discriminant analyses, LOGIT models, etc. For example, if a user wishes to 
identify any specific differences in the lipid metabolites profiles of diabetics when compared to non- 
diabetics, a user may select both individuals from the database that have diabetes and appropriate 

15 non-diabetic controls. To identify the lipid metabolite that best discriminates diabetics from non- 
diabetics; a discriminant analysis can be performed. The results of the discriminant analysis yield a 
single metabolite and the range of biological concentrations of that metabolite that best predicts the 
presence of diabetes. 

A panel or profile of metabolites that predict diabetes can be created by, for instance, the 
20 following two methods, (1) by performing the described analysis repeatedly, and with each iteration, 
removing the discriminated metabolite or (2) by performing a discriminant analysis on summary or 
converted data, where the input values for the discriminant analysis are themselves values calculated 
from quantitative metabolite data, computed from either a random combinatorial approach or from a 
user-defined algorithm. A user defined algorithm can be exemplified by the following: (the sum of 
25 all fatty acids containing a delta-5 double bond) divided by (the sum of all fatty acids not containing a 
delta-5 double bond). 

The database may also be mined by visual tools, such as the "heat map" or targeting charts 
described herein, or by other methods of organizing and visualizing data according to a user-defined 
organization scheme. These methods of organization may include organizing the data by metabolic 
30 pathway, groupings of nutritionally related fatty acids, or the degree of difference between or among 
tested groups of samples. 
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XIL Presentation of the Data 

Presentation of data from the provided databases may be, at least in part, governed by the 
goal(s) of the user. Thus, it is contemplated that views and user interfaces may vary with the specific 
application to which the database is being put, and the specific information the user is mining from 
5 the database. By way of non-limiting example, two specific models of data output and user views are 
provided. These will be referred to herein as the "heat map" model or system, and the "targeting" 
model or system. 

Heat map model: A representative example of a heat map is shown in Figure 5 In a heat 
map display, quantitative metabolite data from a test sample is compared to quantitative metabolite 
10 data from a base line or standard sample (a control) and the increase or decrease in each metabolite is 
indicated on the display, usually in a readily recognizable visual manner. 

The data points can be presented in a two-dimensional layout, such as the chart shown in 
Figure 5, so that the columns contain data from for instance individual fatty acid chains or saturation 
level, while the row are arranged by lipid class, tissue type, species, or any combination thereof. 

15 Other arrangements can easily be envisioned, for instance bar graphs in two or three dimensions, 
which would also enable an overall picture of the data to be displayed. 

By way of example, as shown in Figure 5, the increase or decrease is indicated on the 
display by the color of the relevant block on the chart, and the relative amount of the increase or 
decrease is indicated by the intensity of that color. Thus, in the embodiment pictured in Figure 5, an 

20 increase in the indicated metabolite is colored green, and the brighter the color (the further it is from 
black), the greater the percentage increase. Decreases may be shown in red (of varying intensity). 
Black can be used to indicate that there is no (or relatively little) change in the level of that 
metabolite. A glance at the heat map shows clearly those columns or rows that deviate from the 
standard, because those changes are indicated in a different color. 

25 For instance, in the data location found in the first data column of Figure 5 (labeled 14:0), 

and the first row (the heart sphingomyelin (SP) sample), the test sample contained 80% more of the 
indicated metabolite (14:0 fatty acid, associated with sphingomyelin) than the control sample; the 
relevant block on the heat map is colored bright green, to indicate that the test sample had a relatively 
high increase in the level of this metabolite. 

30 The number of gradations of color can be varied, depending on the sensitivity desired. The 

provided example displays three different intensities of red and green 

Other systems than color can be used to illustrate that there is a change in the amount of a 
metabolite. For each such other system, a key is usually provided. By example, one non-color based 
system would include cross-hatching, stippling, and other "fill patterns" to indicate increases or 

35 decreases in metabolite level. In a three-dimensional depiction, the apparent height of a column 

(upwards or downwards from a given plain) may be indicative of the relative amount of change in the 
metabolite that is depicted by that column. One element of all of these embodiments (including color 
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coding) is that patterns of change can be recognized graphically, without necessary recourse to raw or 
processed data numbers. 

Optionally, the actual percentage increase (or decrease), or the absolute increase (or 
decrease) can be indicated on the heat map. In the provided example, the percentages are given for 
5 those metabolites that differ from the control sample by 10% or more (Figure 5). In alternative 
embodiments, the percentage can appear as a pop-up, for instance when a cursor is passed over the 
relevant location on the chart, or can be accessed by clicking on or otherwise indicating interest in a 
specific location within the chart. Relevant statistical information relating the compared data also can 
be presented in this way. 

10 Data presented as a heat map can be organized in various ways, for instance, by metabolic 

pathway, magnitude, or direction of effect, significance of effect or by a system of categorizing the 
rarity or importance of an effect An example of the importance of an effect is provided in Figure 5, 
which depicts many changes in lipid metabolism as the result of a pharmaceutical intervention (see 
Example 1). The increase in heart cardiolipin concentration is small relative to the increase in many 

15 metabolites, however, this result is rare and important to heart mitochondrial function. One benefit of 
organizing a heat map by tissue/organ or metabolic pathway is that it facilitates identification of 
systems that are strongly affected by the test condition. Similarly, other methods of organization can 
be used to highlight other information in the database. 

In other embodiments, black is used to color the cells (locations in the heat map) 

20 representing metabolites that were not statistically different from each other. The degree of statistical 
significance required before coloring begins can be assigned by the user. In one embodiment, a 
Students t-test statistic can be calculated from the data used for comparison. The user can determine 
the level of significance required for coloring each cell. A standard level of significance would be a 
P-value of less than 0.05, which represents a 95% chance of the difference between the average of the 

25 treatment group and the control being truly different. If the difference between the average of the 
treatment group and the average of the control group has a P-value of less than 0.05, then the 
corresponding cell will be colored according to the degree of difference. 

The user can define the "bin range" for the color scheme. For instance, one user may want 
to set a % difference of 50% to be represented by the maximum color brightness, while another user 

30 may wish to set the maximal difference to be 100%. 

In some of the provided embodiments, the user is able to define the data type for display. 
While the database will contain quantitative data, the display type may be quantitative data (molar), 
quantitative (by mass), or relational by either moles or mass (mole % or weight %, respectively). 
These data types are easily calculated on the fly by the database engine. 

35 The value of the differences in metabolites can be calculated in various ways, for instance as 

a percentage difference, a mean difference, or a percentage or mean difference of transformed data 
between two samples or sample groups. 
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Targeting model: A representative targeting display is shown in Figure 6. In a targeting 
display, quantitative metabolite data is compared from two samples that have been subjected to 
different treatments, for instance treatment with two different drugs or a drug and a test compound. 
The percentage or absolute changes (versus the standard sample) in the measured metabolites are 
5 plotted against each other on a Cartesian graph. This visual system facilitates the comparison of the 
global and individual metabolic effects of the two conditions being examined. If the two treatments 
(e.g., two drug treatments) affect the biological system in metabolically similar ways, the data points 
will fall along a line with a slope of 1, running through the origin (from the lower left to the upper 
right quadrant, in other words). Outlier data points, where one treatment or the other has a different 
10 effect on a metabolite, will lie in the upper left or lower right quadrants, and are therefore easily 
identified. These data points reflect a single metabolite that is increased in one treatment, but 
decreased in the other. The further off the "equivalent" slope a data point is, the greater the 
magnitude of differential effect that is being illustrated. 

This format for data output is particularly useful when two conditions are being directly 
15 compared to each other, with only one or a very few variables are different between the two samples. 
Such conditions may be, for instance, treatment with two known drugs or pharmaceutical agents, or 
with a known drug and a toxin, unknown agent, or potential drug candidate. Other examples include 
a drug treatment compared to a genetic alteration (e.g., a knockout mutation) or a disease state. 
Representative "targeting" applications are described more fully below. 

20 

XIII. Applications 

The metabolite profiles and databases produced therefrom can be used in myriad 
applications, including providing information about individual subjects, about disease states or other 
conditions, about dietary effects, about drug treatments or treatments with drug candidates, about side 

25 effects, and so forth. The provided methods and databases can be used to diagnose, prognose, and/or 
predict disease or other conditions, to monitor drug treatment for efficacy or side effects, to identify 
useful drug targets, to identify potential therapeutic agents with specific metabolic effects, or to 
compare the effects of multiple drugs or other compounds or conditions. Specific examples of 
individual applications are described more fully hereafter. 

30 It is also contemplated that the lipid metabolomic methods and databases described herein 

can be used as clinical diagnostic assays, providing a comprehensive read out of lipid metabolic 
responses to a drug or drug treatment regimen. A clinician can use lipid metabolomic profiles, taken 
before, during, and after drug treatments to determine and track the effectiveness of a drug treatment. 
Metabolomic indicators of successful (or unsuccessful) treatment in many systems are detectable 

35 before other clinical indicators become detectable, and thus this system provides faster and more 
precise characterization of an individual's response to a treatment or treatment regimen. Thus, a 
clinician can examine lipomic data as a way to monitor the efficacy of a particular treatment or 
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dosing strategy, and adjust the treatment earlier than if conventional laboratory indicators are used 
alone. 

The quantitative metabolite data, and methods for acquiring these data, provided herein can 
be used to identify and/or describe the complete metabolic consequences of deleting, over-expressing 
5 or otherwise changing the presence or expression of a gene. Such comparison can be used to identify 
the direct product of some genes, particularly those that are involved in the studied metabolic 
pathways (e.g., pathways of lipid catabolism or anabolism). In some embodiments, this can be used 
to identify the metabolic pathways affected or controlled by said gene. This type of comparison also 
can be used to identify what aspects of metabolism are affected by the downstream consequences of 

10 metabolic pathways controlled by the designated gene. 

Quantitative lipid metabolome data as provided herein can be used as quantitative traits for 
gene mapping. For instance, individual fatty acid types present in single lipid classes or aggregate 
values, such as total number of moles of n-9 fatty acids per gram of plasma, or total moles of 
cardiolipin per gram of tissue, can be correlated with one or more genes. In specific embodiments, 

15 these quantitative traits are the products of an algorithm that relates metabolite values to specific 

genotypic changes, as the quantitative relations among metabolites are often the result of protein gene 
products. 

Quantitative metabolite data, particularly quantitative lipid metabolite data as determined 
using methods described herein, can be used to identify the effects of specific pharmaceuticals, 

20 toxicological agents, or nutritional interventions (or combinations thereof) on lipid metabolism. 

The methods provided herein can be used to identify one or more unknown molecular 
targets of a pharmaceutical, toxin or nutrient, or the metabolic function of a gene, by comparing the 
quantitative measurements of lipid metabolites against a quantitative database of lipid metabolites. 
Such a database contains the quantitative results of trials wherein the effects of genes, 

25 pharmaceuticals, toxins, or nutrients are determined and recorded. One embodiment of this approach 
is depicted in Figure 5. 

This disclosure includes methods for comparison of the metabolic effects of two or more 
pharmaceutical agents, genes, toxins or nutrients by comparing the quantitative results of trials 
determining the quantitative effects of these compounds on lipid metabolites. Such quantitative 

30 effects can be compared by directly comparing the lipid metabolite profiles of samples that are 
different as regards the agent, gene, toxin, or nutrient in question. 

In some embodiments, a drug with a known mode of action is compared with a drug 
candidate whose mode of action is unknown or uncharacterized. Lipid metabolite profile data 
reflecting the effects of the known drug and the drug candidate can be plotted against each other in a 

35 "targeting" model output (such as shown in Figure 6). Comparison of two treatments to each other 
facilitates the identification of compounds that have similar (or dissimilar) effects on the tested 
metabolic system, and thus enables the identification of compounds that are likely candidates as 
therapeutic agents for use in specific systems. 
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For instance, to identify an agent that could be useful in treating diabetes, profiles from 
samples that were treated with possibly active agents (test agents) are compared with one or more 
profiles of samples that were treated with known anti-diabetes agents. Test agents that demonstrate 
similar metabolic effects to known anti-diabetes agents are identified as good candidates for further 
5 characterization. 

Similarly, this system can be used to examine candidate agents for those that have a similar 
therapeutic effect, without one or more undesirable side effects associated with the known therapeutic 
agent. Where a known therapeutic agent is known to affect a specific metabolite (or subset of 
metabolites), a direct compound to compound metabolomic comparison (e.g., presented as a targeting 
10 chart) can be used to identify agents that affect some metabolites in the same manner as the 
therapeutic agent, but that do not affect the "side effect" metabolites in the same manner. 

In specific examples of these embodiments, the biological samples are in vitro cultured cells 
that have been subjected to treatment with different agents that are known to or suspected of having 
biological activity, and/or the characteristic of disturbing or altering the metabolome of cells to which 
15 they are applied. 

One specific embodiment is depicted by Figure 6, wherein the percentage difference in each 
metabolite resulting from treatment is plotted for each of two treatments trials (rosiglitazone and 
CL3 16,243) in a two-dimensional scatter plot (a "targeting" chart). The metabolites found in the 
lower left and the upper right quadrants of the scatter plot (especially those where the X and Y values 
20 are similar) represent those likely affected by similar molecular mechanisms. Metabolites present in 
the upper left or the lower right quadrants of the scatter plot represent the products of different 
molecular affects of the intervention. 

Other targeting chart applications include comparisons between any two conditions, for 
instance drug-toxin versus xenobiotic influence or gene-toxin versus xenobiotic influence. This 
25 method of data visualization can also be used, for instance, to assess the differences caused by 
individual diet component changes, or whole system dietary changes (e.g., omnivorous versus 
vegetarian), and so forth. In essence, any two possible treatments can be compared to each other, and 
using the targeting chart the differences and similarities of metabolic influence can be readily 
determined. 

30 Also contemplated are methods of using the provided databases to test subjects for their 

relation to a metabolic baseline, for instance prior to or following a clinical trial. The metabolic 
profile of a subject (for instance, an individual or a test animal) is determined and compared to a base 
line profile for a similar subject, or a baseline metabolic fingerprint that has been assembled from 
multiple metabolic profiles from a collection of similar subjects. Alternatively, the metabolic profile 

35 of the subject can be compared directly to a previous metabolic profile of that same subject that has 
been determined to be an accepted baseline for that individual subject Differences in the subject's 
profile are indicative of deviations from the baseline. 
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Quantitative relationships that are defined among lipid metabolites using the methods 
described herein can be used to assess the relative activity or function of lipid metabolic enzymes. 
This approach can be used to identify protein targets of pharmaceutical agents, genes, toxins, or 
nutritional components. 

5 Individualized risk assessment and directed metabolite analyses are also contemplated. The 

methods provided herein can be used to profile the lipid metabolites of an individual, which results 
are then compared to a database that contains a plurality of profiles from like and similar individuals. 
The individual can then be provided with, for instance, information regarding likely health risks, 
tendencies to disease or condition, appropriate (or inappropriate) diet, or other information garnered 

10 by comparison to the accumulated metabolomic database. In one specific embodiment, the individual 
profile is compared to subject that have been treated with specific drugs or who have undergone other 
medical treatments, and the likelihood of drug detrimental side effects for the test individual is 
determined. Individual analyses can be used to diagnose specific diseases or conditions that affect 
the metabolic system characterized by the profile and corresponding database. 

15 Lipid metabolomics provides specific information regarding several different diseases or 

other conditions, including for instance organ transplant (e.g., likelihood of rejection, progress of 
acceptance of the donor organ), menopause (and progression through menopause), obesity, diabetes, 
cardiovascular disease, autoimmune conditions, responsiveness to drugs for treatment of each of 
these conditions (including the effectiveness of hormone therapy), and athletic performance or 

20 preparedness. Lipid metabolomic fingerprints can be prepared that provide diagnostic, predictive, 
and or effectiveness characteristics for each of these conditions. 

2VX. Animal Models 

The methods described herein can be used to analyze animal samples and create an animal- 
25 based metabolite database, such as a lipid metabolite database, that can be mined for information. 

The dominant research platform for biotechnology research is the inbred mouse. Such mice 
have constant genomes, making them particularly attractive as laboratory research models. They 
have phenotypes that mirror human diseases, and they have fixed, homozygous genomes. Because 
the genome of each inbred mouse strain is constant, and because the nutrition of captive research 
30 mice can be carefully controlled, phenotypic differences among strains can be attributed directly to 
differences in their genes. 

The medical and pharmaceutical communities use these inbred mouse strains to locate and 
identify the genes responsible for disease and to test the efficacy of new pharmaceutical products. 
Although the locations and sequences of many disease-linked genes have been identified, very few of 
35 these genes have been linked with their metabolic function. Determining the metabolic function of 
genes is critical for validating the gene as a potential target for therapy. The methods provided herein 
provide the necessary link between existing genetic targets and actual metabolic function. 



WO 03/005628 



) 

PCT/US02/21426 



-42- 

Lipid metabolomic profiles are produced for each inbred mouse strain under defined 
laboratory conditions (including, for instance, feeding and watering schedule, temperature, caging, 
and so forth). Profiles can be generated for a plethora of different standard condition sets. These 
profiles then serve as a baseline to which any modification of the strain's genome can be compared. 
5 For instance, a knockout mouse can be generated, which has been rendered defective in a single 
target gene. By comparing the lipid metabolite profile of the knockout mouse (or a set of such 
knockout mice) under defined laboratory conditions, specific metabolic effects of the gene knockout 
can be identified. This comparison can be used to discover, test and validate disease targets identified 
through genomic-, metabolomic-, or and proteomic-based techniques. 

10 Similarly, this comparison technique can be used to examine metabolite changes caused by 

applying a compound to the experimental mouse(or other research animal such as monkeys), for 
instance by feeding the mouse the compound. Thus, drugs and drug candidates can quickly and 
reliably be tested for their metabolic effects. 

By way of example, inbred mice strains can be selected to represent a spectrum of metabolic 

15 disease (normal growth, obesity, lean growth, and diabetes, for instance), and their baseline lipid 

metabolite profiles assembled into a database. This database can be queried by comparing a test lipid 
metabolite profile to it, and determining the similarities and differences. An animal database such as 
the mouse database can also be used to profile the effects of specific pharmaceutical products, for 
instance products that are under public scrutiny or commercial development. 

20 In certain embodiments of the animal lipid metabolite databases, samples are assayed and 

lipid metabolite profiles prepared from multiple tissues from each subject mouse strain. For instance, 
the database may include samples from any tissue, such as one or more than one of the following: 
blood or blood products (such as plasma), heart, adipose (all types), liver, muscle, kidney, spleen, 
lung, testes, and brain. 

25 Examples of the provided databases also may include data from different species, including 

for instance humans, non-human primates, and mice. Comparisons of data and data sets, as well as 
trends or discrepancies in metabolite levels between data from the different species, can provide 
identification of shared or divergent pathways between the species. Comparison of data between 
different species can also be used to study or predict the effects of drugs on the measured metabolites, 

30 for instance in order to predict the effects of a drug in a human system after it has been tested in an 
animal model. 

Other specific uses for animal model databases include drug and other pharmaceutical 
screening, hazard models {e.g., where samples are taken from animals that have been exposed to one 
or more toxins, chemicals, or other hazards), and disease testing (particularly where there is a 
35 recognized model animal system that is useful for gathering comparative data that may be useful for 
correlation with human disease). 
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VX. External Quality Control 

The metabolomic databases described herein can be used to identify biological outliers in 
incoming data. Because certain of the provided databases contain data that defines the biological 
variation in each metabolite across a wide variety of species, tissues and conditions, the cumulative 
5 information base can be used to identify metabolite concentrations that are unusually high or low 

given prescribed criteria. These criteria can be set by the user, and may consist of restricting the data 
used for comparison purposes to species, tissue, treatment, age, etc. 

The invention is further illustrated by the following non-limiting Examples. 

10 

EXAMPLES 

EXAMPLE 1 : Lipid metabolome-wide effects of the peroxisome proliferator- 
activated receptor y agonist rosiglitazone 

15 This example provides specific methods of generating and using quantified metabolite 

profiles to study the effects of a therapeutic compound. 
Samples 

Mouse tissue and plasma samples were a generous donation to Lipomics Technologies from 
Dr. Edward Leiter of the Jackson Laboratory (Bar Harbor, ME). Samples included the plasma, heart, 
20 liver and inguinal adipose of mice treated with pharmaceuticals or their corresponding controls. 

In trial 1, prediabetic male Fl mice (from a cross of the obese NZO and lean NON mouse 
strains) were fed a control diet with or without the presence of the PPARs-y agonist rosiglitzazone for 
4 weeks (at 0.2 g rosiglitazone per kg body weight). 

In trial 2, male, inbred NZO mice were fed a control with or without the presence of the b-3 
25 adenergenic agonist CL3 16,243 for four weeks (at 0.001% CL316,243 by weight in the dietary 
chow). 

In both studies, five treated and five control mice were used. Following the treatments and 
the killing of the mice, tissues and plasma were taken, chilled to -80 °C and shipped to the analysis 
laboratory at Lipomics Technologies in a frozen state. 
30 Extraction 

The lipids from plasma and tissues were extracted in the presence of authentic internal 
standards by the method of Folch et al (J. Biol Chenu 226, 497-509, 1957) by homogenization in a 
fluid extractant consisting of chloroform:methanol (2:1 volrvol). Plasma (200 jd), or 10 mg inguinal 
adipose tissue was used for each analysis. For each sample, an appropriate mass of internal standard 
35 was added. The internal standard compounds chosen may take many forms, but in one specific 
example the internal standards added to each plasma sample were: 1.75 ug of heptadecanoic 1- 
heptadecanoyl-2-lyso-phosphatidycholine (for lysophospholipids), 2.25 micrograms of N- 
pentadecenoyl-D-erythro-sphingosylphorylcholine (for sphingomyelin), 39.93 micrograms of 1,2 
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diheptadecanoylphosphatidylcholine (for phosphatidylcholine), 0.93 micrograms of 1,2- 
diheptadecenoylphosphatidylethanolamine (for phosphatidylethanolamine), 2.09 micrograms of 
pentadecaenoic acid (for free fatty acids), 32.93 micrograms of triheptadecaenoic acid (for 
triacylglycerides), 27.27 micrograms of cholesteryl heptadecanoate (for cholesterol esters) and 38.03 
5 micrograms of stigmasterol (for free sterols). 

For the analysis of liver and heart tissues, 25 mg of tissue were placed in a ground glass 
homogenizer and internal standards were added. The internal standards for use in the analyses of 
these tissues may take many forms, but in this instance consisted of: 4.75 ug of N-pentadecenoyl-D- 
erythro-sphingosylphorylcholine; 74.78 ug of 1,2 diheptadecanoylphosphatidylcholine; 33.57 ug of 
10 1,2-diheptadecenoylphosphatidylserine (for phosphatidylserine); 24.13 ug of 

1,2-diheptadecenoylphosphatidylethanolamine; 13.38 ug of l,r,2,2'-tetraheptadecaenoyl cardiolipin 
(for cardiolipin); 1.12 ug of pentadecaenoic acid; 27.82 ug of triheptadecaenoic acid; 1.56 ug of 
cholesteryl heptadecanoate; and 27.70 ug of stigmasterol. 

The solution mixture consisting of sample, fluid extractant, and internal standards was 
15 homogenized by twelve strokes with a ground-glass homogenizer. Following homogenization, 1.8 
ml of 0.01 M potassium chloride was added, and the solution was vigorously mixed. The organic 
fraction containing the lipids and the internal standards was separated from the polar fraction of the 
. mixture by centrifugation. The lipid extract was removed from the mixture and concentrated under a 
stream of nitrogen in preparation for lipid class separation. 

20 Separation of Lipid and Phospholipid Classes 

The separation of lipid classes was performed by preparative thin-layer chromatography 
(TLC), essentially as previously described (Watkins et al, Lipids 36:247-254, 2001). To remove any 
residual metal or other damaging contaminants on the TLC plates, each plate was washed prior to 
use. Washing the plates is a three-step process that involves impregnating each plate with 

25 ethylenediamine tetraacetic acid (EDTA) and rinsing the plates once with methanol and once with 
chloroform. Each plate is first impregnated with 1 mM EDTA, pH 5.5, by ascending development 
using the method of Ruiz (/. Lipid Res. 38, 1482-1489, 1997). After each plate was completely 
developed, it was dried in air overnight. Once dry, each plate was developed in methanol, dried, and 
developed in chloroform in the same direction as the development with EDTA. The washed plates 

30 were then dried in air. Just prior to use, each plate was activated by heating to 1 10 °C for 10 minutes. 

To prepare the TLC chamber for chromatography, Whatman (Clifton, NJ) filter paper was 
cut into 20 X 80-cm strips and wrapped around the inside wall of a 30 x 60 x 10-cm glass 
development chamber. One hundred milliliters of the appropriate mobile phase was added to the 
chamber, and the chambers were sealed and allowed to equilibrate. Chambers were considered 

35 equilibrated when the solvent front had completely ascended the filter paper. The mobile phase 
employed for the separation of phospholipid classes (lyso-phospholipids, sphingomyelin, 
phosphatidylcholine, phosphatidylserine, phosphatidylethanolamine and cardiolipin) was a 
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modification of the solvent system described by Holub and Skeaf ("Nutritional regulation of cellular 
phosphatidylinositol," mMeth. Enzym., ed. Conn (Academic Press, Inc., Orlando), pp. 234-243, 
1987) consisting of chloroform/methanol/acetic acid/water (100:67:7:4, by vol). 

For the separation of neutral lipid classes (free fatty acids, free sterols, triacylglycerides and 
5 cholesterol esters), a solvent system consisting of petroleum ether/diethyl ether/acetic acid (80:20: 1 , 
by vol) was used (Mangold, Thin Layer Chromatography- A Laboratory Handbook (Springer- Verlag, 
New York), 1969). 

After the TLC plate was activated, the sample extracts were spotted onto the activated plate. 
As a general rule, samples were spotted at an estimated concentration such that no single lipid class 

10 was present at more than 25 u.g per centimeter of plate width following chromatography. This 
ensured that the plate was not overloaded and minimized the risk of cross-contamination between 
lipid classes (cross-contamination is readily identified during sample analysis as each lipid class 
contains unique internal standards). Authentic lipid class standard compounds were spotted on the 
two outside lanes of the thin-layer chromatography plate to enable localization of the sample lipid 

15 classes. 

Lipid class separations were performed on TLC plates with a 10-cm separation length, while 
PL class separations were performed on TLC plates with a 20-cm separation length. Because lipid 
visualization reagents invariably degrade certain ahalytes, most notably the polyunsaturated fatty 
acids, the identification of individual lipid classes was performed by comparison with authentic lipid 

20 standards chromatographed in reference lanes. Each reference lane was spotted with a mixture of 

authentic lipid standards (obtained from Avanti Polar Lipids, Alabaster, AL), and when the amount of 
sample is not limiting, the sample extract was also spotted onto the reference lanes. Once the TLC 
plates were spotted and the tanks were equilibrated, the plates were transferred into the tank 
containing the appropriate mobile phase, and the sample was chromatographed until the mobile phase 

25 ascended to 1-cm below the top of the plate. 

Once the TLC plate is developed, the reference lipids were visualized by cutting the 
reference lanes from the plate, dipping the reference lanes in 10% cupric sulfate/8% phosphoric acid 
and charring the reference lanes at 300 °C. The charred reference lanes were used to identify the 
location of lipid classes on the analytical plate. Each sample was scraped from the plate using a clean 

30 razor blade and the silica scrapings were placed in a 2-mL glass vial for derivitization. Great care 
was taken to develop this process so that it meets the following criteria: 

(1) reference standards co-migrate with sample analytes with great accuracy; 

(2) chromatographic separation between the lipid classes is maximized to avoid any 
cross-contamination problems; and 

35 (3) the portion of the plate containing analytes is not exposed to environmental stresses such 

as air, light or any reagent that would cause the degradation of specific analytes. 

The silica scrapings containing the free sterol fraction were exposed to a fluid extractant 
consisting of one milliliter of chloroform:methanol (2:1 vol/vol). The mixture was mixed vigorously 
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and allowed to sit for 15 minutes, then 0.3 mL of 0.01 M potassium chloride was added, and the 
solution once again mixed vigorously. The organic fraction containing free sterols was separated 
from the polar fraction of the mixture by centrifiigation. The extract including free sterols was 
removed from the mixture and completely dried down under a stream of nitrogen. A 20-uL aliquot of 
5 chloroform was used to transfer the reconstituted free sterols to a conical insert in preparation for free 
sterol separation via capillary gas chromatography. No derivitization was necessary to prepare the 
free sterols for gas chromatographic analysis. 
Derivatization 

Once the individual lipid classes were separated, the fatty acids were hydrolyzed from their 
10 respective glycerolipids and prepared for gas chromatography. Each lipid fraction was scraped from 
the TLC plate using a clean razor blade and placed in a 2-mL glass vial. A 400-pL aliquot of 3N 
methanolic-HCl (Supelco, Bellafonte, PA) was added to each vial, and the vials were sealed under 
nitrogen. The sample vials were incubated at 100 °C for 45 minutes in order to trans-methylate the 
fatty acids. After incubation, the vials were cooled at 4 °C for 20 minutes. The fatty acid methyl 
15 esters were extracted from the transmethylation-mixture with hexane. A 500-uL aliquot of 6% 
K 2 C0 3 (w/v) and 200 \xL of hexane containing 0.05% butylated hydroxy toluene as an antioxidant 
was added to each vial, and the vials were sealed and mixed on a vortex mixer. The sample mixture 
was then centrifuged at 500 x g to separate the hexane fraction, which contained the fatty acid methyl 
esters, from the methanol/water fraction. The hexane containing the fatty acid methyl esters was 
20 transferred into 200-nL conical inserts and sealed in 2-mL glass tubes under nitrogen in preparation 
for gas chromatography. Each sample was concentrated by drying the sample under a stream on 
nitrogen as necessary. 
Chromatography 

Fatty acid methyl esters were separated and quantified by capillary gas chromatography 
25 using a Hewlett-Packard (Wilmington, DE) 6890 gas chromatograph equipped with a 30 m DB- 

225MS capillary column (J&W Scientific, Folsom, CA), and a flame-ionization detector, essentially 
as previously described (Watkins et a/., Lipids 36: 247-2548, 2001). The separation conditions were 
as follows: The injector temperature was set to 270 °C and the detector temperature will be set to 
280 °C. The oven temperature was increased from 165 °C to 215 'C at 4.0 °C per minute and held at 
30 215 °C for 12 minutes. The temperature was then increased to 230 °C at 30 °C per minute and held at 
that temperature for three minutes to drive off any high-boiling contaminants. Split ratios were 
maintained at about 40:1. The column and oven conditions described above are subject to slight 
modification over the course of the experiment because this laboratory requires that every fatty acid 
be completely resolved to baseline for a chromatogram to pass quality control. A sample 
35 chromatogram is provided in Figure 4A. 

Sterols were separated and quantified by capillary gas chromatography using a Hewlett- 
Packard (Wilmington, DE) 6890 gas chromatograph equipped with a 30 m DB-35MS capillary 
column (J&W Scientific, Folsom, CA), and a flame-ionization detector. The separation conditions 
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were as follows: The injector temperature was set to 310 °C and the detector temperature was set to 
280 °C. The oven temperature was increased from 285 °C to 320 °C at 2.5 °C per minute. The 
temperature was then increased to 335 °C at 50 °C per minute to drive off any high-boiling 
contaminants. Split ratios were maintained at about 100:1. The column and oven conditions 
5 described above were subject to slight modification over the course of the experiment because this 
laboratory requires that every sterol be completely resolved to baseline for a chromatogram to pass 
quality control. 

Integration, Data Handling and Visualization 

Following chromatography, each chromatogram was integrated using Hewlett-Packard 

10 (Wilmington, DE) ChemStation™ software. At the beginning of each batch of samples, a standard 
mixture was run, containing a known concentration of each of the fatty acids listed in Table 6, below. 
Each fatty acid in its methyl ester form is present in this standard mixture. The quantitative standard 
was used to set a calibration table that automatically corrected the areas associated with each fatty 
acid methyl ester from the samples for injection discrimination and injector non-linearity. A 

15 representative chromatogram from a standard mixture is shown in the bottom half of Figure 4B. 

Significant differences were assigned to a difference in a lipid metabolite concentration 
between treated and control mice on the basis of Student's f-tests (P< 0.05). 

Quantitative (nmol per g) data were visualized using the Lipomics Surveyor™ software 
system, which creates a "heat-map" graph (Figure 5) of the difference between the data for treated 

20 and control mice. The Surveyor™ data are read as follows: the column headers display the fatty acid 
and the family of fatty acids present in each lipid class, which are in turn described in the row 
headers. The lipid classes are grouped by tissue, and color-coded by metabolic pathway, as depicted 
in Figure 5. The heat map displays an increase in each metabolite in rosiglitazone-treated mice 
relative to control mice as a green square and a decrease in a metabolite as a red square. The 

25 brightness of the square indicates the magnitude of the difference, as detailed in the figure legends. 

RESULTS: 

Metabolomic assessment of plasma lipids 

The results of the quantitative assessment of the plasma lipid metabolome in rosiglitazone- 
30 treated and untreated mice are shown in Figures 7 and 5. Lipid metabolite concentrations in plasma 
confirmed the rosiglitazone-induced depletion of specific classes of plasma lipids. Significant 
rosiglitazone-mediated decreases in phosphatidylcholine, triacylglyceride, and cholesterol ester 
distinguished rosiglitazone-treated mice from untreated mice, whereas no significant decreases in 
sphingomyelin, phosphatidylethanolamine, or free fatty acids were observed (Figure 7). 
35 Phosphatidylcholine, cholesterol ester, and triacylglycerides are derived principally from liver lipid 
export. Total plasma triacylglyceride concentrations were lower in treated mice (400 nmol/g) than in 
untreated mice (1,400 nmol/g) (Figure 7). The concentrations of total plasma free fatty acids, which 
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are derived principally from adipose tissue, were not affected by rosiglitazone treatment Although 
the total concentrations of phosphatidylcholine and cholesterol ester were lower in rosiglitazone- 
treated mice than in untreated mice, the absolute concentration of palmitoleic acid (16:ln7) within 
these lipid classes and within free fatty acids was higher in treated mice than in controls (Figure 5). 
5 The increased palmitoleic acid concentrations in plasma were reflective of the increased de novo 
lipogenesis occurring within the liver and adipose tissue (see below). 
Induction of de novo lipogenesis 

Rosiglitazone-treated mice showed clear signs of increased de novo lipogenesis relative to 
control mice. Every lipid class in liver except sphingomyelin and the free fatty acid, cholesterol ester 

10 and total phospholipids of plasma contained a quantitative increase in palmitoleic acid (16: ln7). 

Additionally, the free fatty acid and triacylglycerides in adipose and every phospholipid class in heart 
contained an increased concentration of 16:ln7. 16:ln7 is the direct biosynthetic product of fatty 
acid synthase (the metabolic pathway for producing fatty acids in vivo) and the A9 desaturase. 
Additionally, this fatty acid was not present in the experimental diet. Hence, the substantial increase 

15 in 16:ln7 present in many liver, plasma, heart and adipose lipid classes (see Figure 5, column header 
"16:ln7") is the direct product of de novo lipogenesis. 

The bright green cross-hatch pattern (horizontal- "liver TAG"; vertical- "16:ln7") combined 
with the clear depletion of triacylglycerides from plasma (bright red line next to "plasma TAG") 
visible in the "heat map" produced from the data (Figure 5) from this study suggests a dual cause for 

20 the known accumulation of lipid in the livers of rosiglitazone-treated mice. First, it is clear that the 
rosiglitazone treatment caused a decrease in triacylglyceride mobilization from the liver into plasma. 
This result is confirmed by data acquired by Dr. Edward Leiter of the Jackson Laboratory, which 
demonstrated an increase in the expression of genes involved in the retention of lipid by the liver. 
Second, the increased liver lipid content resulting from the lack of triacylglyceride mobilization is 

25 compounded by an increased de novo synthesis of lipid as described above. 
Liver lipid metabolism 

The results of the quantitative assessment of the liver lipid metabolome in rosiglitazone- 
treated and untreated mice are shown in Figures 7 and 5. Lipid metabolites in the liver demonstrated 
a reciprocal relation between liver and plasma lipid concentrations. The significant rosiglitazone- 

30 mediated decreases in plasma triacylglycerides were balanced by a substantial accumulation of 
triacylglycerides within the liver (Figures 7 and 5). Total hepatic triacylglycerides were 81,300 
nmol/g in untreated mice and 150,400 nmol/g in the rosiglitazone-treated mice. The concentrations 
of other lipid classes were not affected by rosiglitazone treatment with the exception of 
sphingomyelin, which was present at 1,180 nmol/g in treated mice and at 1,890 nmol/g in untreated 

35 control mice (Figure 7). This rosiglitazone-induced reciprocity between liver and plasma 

triacylglycerides is consistent with an inhibition of normal liver-plasma lipid exchange. No change 
was observed in the total concentration of phosphatidylcholine or cholesterol ester in liver as a 
consequence of rosiglitazone treatment (Figure 7). 
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Inhibition of peroxisomal lipid metabolism 

Two major types of lipids quantified in this study are derived from biosynthetic pathways • 
present in the peroxisome. The fatty acids with three double bonds on the carboxylic acid side of an 
n-9 double bond (22:5n6 and 22:6n3) are synthesized by retroconversion from their biosynthetic 
5 precursors (24:5n6 and 24:6n3, respectively) in the peroxisome. The plasmalogen lipids, those lipids 
that contain one or more 1-enyl-ether-linked fatty acids, are also derived from biosynthetic pathways 
present in the peroxisome. In hearts from mice treated with rosiglitazone there was a substantial 
decrease in the 22:6n3 content of all phospholipid classes except sphingomyelin, as well as in free 
fatty acids and cholesterol esters, relative to control mice. Additionally, there was a significant 

10 depletion of 1-enyl-ether linked fatty acids from the heart phospholipids of rosiglitazone-treated mice 
relative to control mice. These observations are easily detectable in Figure 5, which portrays this data 
in the described "heat map" format. Each of these observations suggests that rosiglitazone, a known 
PPARs-y agonist, has an inhibitory effect on lipid synthesis in the peroxisome. 
Heart lipid class metabolism 

15 The results of the quantitative assessment of the heart lipid metabolome in rosiglitazone- 

treated and untreated mice are shown in Figures 7 and 5. Free fatty acids are the primary source of 
energy for the heart. The average concentration of total free fatty acids in the heart was 5,100 
nmol/gin untreated mice and 2,500 nrnol/g in rosiglitazone-treated mice (Figure 7). This difference 
was largely independent of the type of free fatty acid, as the saturated n-3, n-6, and n-9 families of 

20 fatty acids were all approximately 50% lower in treated mice than in untreated mice (Figure 7). The 
free n-7 fatty acids were not depleted as substantially from heart, likely due to the increased 
biosynthesis of n-7 fatty acids and corresponding increased concentration of n-7 fatty acids within the 
rriacylglycerides and free fatty acids of plasma. 

The hearts of rosiglitazone-treated mice were significantly enriched with cardiolipin, the 

25 primary structural lipid of the inner mitochondrial membrane. The mean cardiolipin content of hearts 
from rosiglitazone-treated mice was 3,000 nmol/g as compared with 2,500 nmol/g in untreated mice. 
Unlike free fatty acids, the fatty acid components of cardiolipin were differentially modulated by 
rosiglitazone treatment. The primary fatty acid of cardiolipin, linoleic acid (18:2n6), was 4,550 
nmol/g in control heart cardiolipin and 8,850 nmol/g in heart cardiolipin of rosiglitazone-treated 

30 mice. Docosahexaenoic acid (22:6n3) was depleted from cardiolipin in the hearts of treated mice 
(950 nmol/g) relative to hearts of control mice (2,200 nmol/g). 

The plasmalogen lipids, those lipids that contain 1-enyl-ether-linked alkyl chains, are 
derived from the dihydroxyacetone phosphate pathway and are partially synthesized within the 
peroxisome. The concentration of plasmalogens was lower in the heart phospholipids of mice treated 

35 with rosiglitazone than of controls (Figure 5). These data are consistent with a decreased peroxisomal 
synthesis of lipids within the hearts of treated mice. 
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Adipose lipid class metabolism 

The results of the quantitative assessment of the inguinal adipose lipid metabolome in 
rosiglitazone-treated and untreated mice are shown in Figures 7 and 5. Inguinal fat tissue from 
rosiglitazone-treated mice displayed a 5.7% lower triacylglyceride content (9,628 umol/g) than 
5 inguinal adipose from controls (1,019 umol/g), and 35% more free fatty acids (13,370 nmol/g in 
treated mice and 9,900 nmol/g in controls). No significant differences in total phospholipid or 
cholesterol ester concentrations were observed (Figure 7). 

The fatty acid composition of inguinal fat triacylglycerides was substantially altered by 
rosiglitazone treatment, with inguinal fat from treated mice accumulating fatty acids from the 

10 saturated n-7 and n-3 families of fatty acids, while being depleted of the n-9 family of fatty acids 
(Figure 7). In particular, an unusual accumulation of n-3 fatty acids was observed in inguinal fat 
from rosiglitazone-treated animals. The concentration of total n-3 fatty acids in the inguinal fat 
triacylglycerides of treated mice was 71,260 nmol/g, representing a 120% greater concentration than 
that in untreated mice (Figure 7). The most notable increases within the n-3 family of fatty acids 

15 were a 522% greater concentration (4,100 nmol/g) of eicosapentaenoic acid, a 612% greater 

concentration (7,000 nmol/g) of docosahexaenoic acid, and 84% (24,300 nmol/g) more a-linolenic 
acid in inguinal fat triacylglycerides in treated as compared with control mice (Figure 5). The 
concentration of n-7 fatty acids in inguinal fat triacylglycerides was 303 umol/g in treated mice and 
204 umol/g in untreated controls (Figure 7). In contrast, the total concentration of n-6 fatty acids was 

20 less than 3% higher. However, the accumulation or depletion of individual fatty acids within the n-6 
family varied substantially. Whereas linoleic acid (18:2n6), by far the most prominent n-6 fatty acid 
in inguinal fat, was not significantly altered by treatment, the concentrations of y-linolenic, dihomo-y- 
linolenic, and arachidonic acids in inguinal fat were respectively, 1,225 nmol/g (78 %), 1,300 nmol/g 
(64 %), and 3,800 nmol/g (276 %) greater in treated mice than in untreated controls (Figure 5). 

25 The concentration of plasmalogen lipids in inguinal fat phospholipids was depleted by 

rosiglitazone treatment (Figure 7). The concentration of total plasmalogen fatty acids from the 
phospholipids of inguinal fat was 130 nmol/g (60%) less in treated mice than untreated controls. 
Differential effects on individual organs 

It is clear from Figure 5 that the effect of rosiglitazone is variable on different tissues, and 

30 that a complete metabolomic assessment, including the measurement of both fatty acids and lipid 
classes from several tissues is important for understanding the true effects of rosiglitazone on lipid 
metabolism. 

Comparison of the results from trial 1 and trial 2. 

CL3 16,243 is a p-adenergenic receptor agonist that is also used to lower plasma glucose 
35 concentrations in diabetic model systems. The effect of CL3 16,243 on plasma total triacylglycerides 
is known to be similar to that of rosiglitazone. The similarity of effect between these two compounds 
on plasma lipids is largely validated by lipomic analysis results. This can be observed visually when 
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the quantitative results from a complete raetabolomic assessment of both rosiglitazone and 
CL316,243 are plotted together in a "targeting" graph (Figure 6). 

However, there a several metabolites that cluster in the upper left quadrant of the targeting 
graph, including cholesterol ester 16:ln7, triacylglyceride 16:ln7, cholesterol ester 18:ln7, 
5 phosphatidylcholine 16: 17 and cholesterol ester 20:3n9. Each of these fatty acids can be produced de 
novo in animals, and none of these fatty acids were prevalent in the experimental diet. Hence, the 
cluster of metabolites present in the upper left quadrant of the targeting graph clearly show an 
increase in the de novo synthesis of fatty acids resulting from rosiglitazone treatment, and that this 
increase was not induced by CL316,243. These results are consistent with the lipomic findings in 
10 liver (discussed above), and demonstrate the utility of a targeting graph for identifying the differences 
in metabolic response to two individual affectors. 

Discussion 

Rosiglitazone treatment is often accompanied by weight gain in humans, an effect strikingly 
15 reflected by the rosiglitazone-induced increase in body weight of already markedly-obese (NZO x 
NON)Fl male mice. In this study, the potent anti-hyperglycemic effect of rosiglitazone was 
accompanied by an increased de novo synthesis of fatty acids. Paimitoleic acid (16:ln7) and 
vaccenic acid (18:ln7) were excellent metabolic indicators of the increased de novo synthesis of fatty 
acids, and the effect appeared to be mediated by an increased expression of fatty acid synthase within 
20 in the liver. This increased synthesis of fatty acids is likely a key metabolic explanation for both the 
weight gain and the severe hepatic steatosis observed in the rosiglitazone-treated animals. 
Interestingly, although lipid biosynthesis was increased, the increase in liver triacylglyceride 
concentration was not reflected in the plasma. Thus, there is a strong indication that normal lipid 
import-export activities between the liver and plasma were impaired by rosiglitazone treatment, and 
25 that this dysregulation and increased biosynthesis of lipids may be mutually responsible for the 
hepatic steatosis. 

Because rosiglitazone decreased the concentrations of plasma lipids as classes of molecules 
(i.e, triacylglycerides, cholesterol esters, etc.), standard clinical markers of lipid metabolism did not 
reflect the increased hepatic de novo lipogenesis in response to rosiglitazone treatment. In contrast, 

30 the metabolomic assessment of plasma lipids identified several markers of increased liver 
lipogenesis, including an increased absolute concentration of 16:ln7 and 18:In7 in plasma 
cholesteryl esters, phosphatidylcholine, and triacylglycerides, despite the decrease in the 
concentration of total plasma lipid classes. The metabolomic analysis of the plasma alone was 
therefore capable of making the important discrimination between hypolipidemia caused by 

35 decreased lipid synthesis compared with hypolipidemia caused by impaired export of lipid by the 
liver. These data suggest that metabolomic analyses of human plasma have strong potential as 
clinical diagnostics. Further demonstrating the strong relations between the plasma lipid metabolome 
and tissue metabolism were the decreased concentration of plasmalogen lipids in plasma and the 
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similarity between the composition of the plasma lipid metabolome and liver and adipose 
metabolomes. 

Heart lipid metabolism was strongly influenced by rosiglitazone treatment. In particular, 
heart free fatty acids, cardiolipin, plasmalogen lipids, and the important polyunsaturated fatty acids 
5 22:6n3 and 18:2n6 were significantly modulated by treatment Some of these changes, particularly 
those involving the concentration and composition of cardiolipin and free fatty acids, may in part 
represent the alterations in muscle metabolism that improve insulin sensitivity. Cardiolipin is an 
essential phospholipid for energy metabolism and the primary phospholipid of the inner 
mitochondrial membrane. The content and composition of cardiolipin are important to the efficiency 

10 of electron transport. Rosiglitazone caused an increase in heart cardiolipin concentration and a 
substantial remodeling of cardiolipin toward an elevated 18:2n6 content and a diminished 22:6n3 
content. Interestingly, this is precisely the change in cardiolipin content and composition that would 
increase electron transport efficiency and decrease electron leakage, according to the existing in vitro 
data. Rosiglitazone-induced remission from hyperglycemia in combination with reduced plasma 

15 insulin concentrations indicated that glucose oxidation by tissues was increased by this insulin- 
sensitizing agent. Thus, it is possible that increased energy metabolism as well as decreased plasma 
lipids may have caused the decreased heart free fatty acid concentrations. 

Two major types of lipids quantified in this study are synthesized at least in part, within the 
peroxisome. These are the fatty acids with three double bonds on the carboxylic acid side of an n9 

20 double bond (22:5n6 and 22:6n3) (Moore et al, J. Lipid Res., 36:2433-2443, 1995; Sprecher et al, J. 
Lipid Res., 36:2471-2477, 1995; Voss et al., J. Biol. Chem. 266:19995-20000, 1991), and the 
plasmalogen lipids, which are synthesized by the dihydroxyacetone phosphate biosynthetic pathway 
» (Nagan & Zoeller, Prob. in Lipid Res., 40: 199-229, 2001). Heart tissue from rosiglitazone-treated 
mice contained significantly less 22:6n3 in phosphatidylcholine, phosphatidylethanolamine, 

25 cardiolipin, phosphatidylserine/inositol, free fatty acids, and cholesterol esters than did heart from 

untreated control mice. Additionally, there was a significant depletion of plasmalogen lipids from the 
heart phospholipids of treated mice relative to untreated controls. These observations suggest that 
rosiglitazone, a known PPARy agonist, has an inhibitory effect on lipid biosynthesis in the 
peroxisome. The decreased production of 22:6n3 and plasmalogen lipid may have important 

30 physiologic consequences. Dietary 22:6n3 has well-documented positive effects on cardiac function, 
and plasmalogen lipids have recently been shown to be essential to membrane trafficking and the 
structure of caveolae, clathrin-coated pits, endoplasmic reticulum, and Golgi cisternae. 

A curious finding in this study was the inguinal fat tissue accumulation of polyunsaturated 
fatty acids in response to rosiglitazone. Accumulation of 22:6n3 and other long-chain 

35 polyunsaturated fatty acids likely occurs via a pathway independent of their biosynthesis de novo 
from precursors. The conversion of polyunsaturated-rich phospholipids to triacylglycerides via a 
phospholipase D pathway also does not appear to be the primary metabolic basis for the enrichment 
with polyunsaturates, as phospholipids were also enriched with polyunsaturated fatty acids. This 
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unusual response may be an important clue to understanding the physiology of adipose tissue 
activated by PPARy agonists, and should be investigated further. 

The present study utilized a diabetic mouse model in which the anti-diabetic action of a TZD 
was accompanied by excessive weight gain and major alterations in the lipid metabolome. Its major 
5 findings were that rosiglitazone (j) induced hypolipidemia by disrupting the mobilization of liver 
lipids into plasma, (if) induced de novo fatty acid synthesis, (Hi) diminished the biosynthesis of lipid 
synthesized within the peroxisome, (iv) had substantial effects on heart cardiolipin and free fatty acid 
metabolism, and (v) exerted tissue-specific effects on lipid metabolism. 

The results presented above clearly demonstrate that metabolomic data can be obtained, 
10 stored, visualized, and analyzed using methods provided herein. 

EXAMPLE 2: Disease/Condition-Linked Lipid Metabolite Profiles (Fingerprints) 

With the provision herein of methods for determining the quantitative levels of a 
comprehensive panel of lipid metabolites, and the ability to assemble such individual metabolite 
15 profiles into a minable database, disease- or condition-linked lipid metabolite profiles (which provide 
information on the disease or condition state of a subject) are now enabled. 

Disease or condition linked lipid metabolite profiles comprise the distinct and identifiable 
pattern of levels of lipid metabolites, for instance a pattern of high and low levels of a defined set of 
metabolites or subset of like or unlike metabolites, or molecules that can be correlated to such 
20 metabolites (such as biosynthetic or degradative enzymes that affect such metabolites). The set of 
molecules in a particular profile usually will include at least one of those listed in Table 6. 



Table 6: 



SCIENTIFIC NAME 


SCIENTIFI 


COMMON NAME 




CABBR. 




! -SATURATED 




Tetradecanoic Acid 


14:0 


Myristtc Acid 


Pentadecanoic Acid 


15:0 




Hexadecanoic Acid 


16:0 


Palmitic Acid 


Heptadecanoic Acid 


17:0 


Margaric Acid 


Octadecanoic Acid 


18:0 


Stearic Acid 


Eicosanoic Acid 


20:0 


Arachidic Acid 


Docosanoic Acid 


22:0 


Behenic Acid 


Tetracosanoic Acid 


24:0 


Lignoceric Acid 


- D9DESATURASE FAMILY - 


9-Tetradecenoic Acid 


14:ln5 


Myristoleic Acid 


9-Hexadecenoic Acid 


16:ln7 


Palmitoleic Acid 


11-Octadecenoic Acid 


18:ln7 


Vaccenic Acid 


9-Octadecenoic Acid 


18:ln9 


Oleic Acid 


1 1-Eicosenoic Acid 


20:ln9 


Eicosenoic Acid 


5,8,1 1-Eicosatrienoic Acid 


20:3n9 


Mead Acid 


13-Docosenoic Acid 


22:ln9 


Erucic Acid 


15-Tetracosenoic Acid 


24:ln9 


Nervonic Acid 
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Table 6 (cont): 



SCIENTIFIC NAME 


SCIENTIFI 


COMMON NAME 




CABBR. 




- OMEGA 3 FAMILY - 


9,12,15-Octadecatrienoic Acid 


18:3n3 


a-Linolenic Acid 


6,9,12,15-Octadecatetraenoic Acid 


18:4n3 




11,14,17-Eicosatrienoic Acid 


20:3n3 


Eicosatrienoic Acid (ETA) 


8,11,14,17-Eicosictetraenoic Acid 


20:4n3 




5,8,11, 14,17-Eicosapentaenoic Acid 


20:5n3 


Eicosapentaenoic Acid (EPA) 


7,10,13,1 6, 1 9-Docosapentaenoic Acid 


22:5n3 


Docosapentaenoic Acid 






(DPA) 


4,7, 1 0, 1 3, 1 6, 19-Docosahexaenoic Acid 


22:6n3 


Docosahexaenoic Acid (DBA) 


6,9,12,15,18,21-Tetracoshexaenoic 


24:6n3 


Tetracosahexaenoic Acid 


Acid 






! - OMEGA 6 FAMILY - 


9,12-Octadecadienoic Acid 


18:2n6 


Linoleic Acid 


6,9, 1 2-Octadecatrienoic Acid 


18:3n6 


g-Linolenic Acid 


1 1,14-Eicosadienoic Acid 


20:2n6 


Eicosadienoic Acid 


8,11,14-Eicosatrienoic Acid 


20:3n6 


Homo-g-Linolenic Acid 


5,8, 1 1 ,14-Eicosicatetraenoic Acid 


20:4n6 


Arachidonic Acid 


13,16-Docsadienoic Acid 


22:2n6 


Docosadienoic Acid 


7,10,13,16-Docosicatetraenoic Acid 


22:4n6 


Docosicatetraeiioic Acid 


4,7, 10, 1 3, 1 6-Docosapentaenoic Acid 


22:5n6 


Docosapentaenoic Acid 


1 - UNUSUAL FAMEs - 


9-Trans-Hexadecenoic Acid 


tl6:ln7 


Palmitelaidic Acid 


9-Trans-Octadecenoic Acid 


tl8:ln9 


Elaidic Acid 


8-Eicosaenoic Acid 


20:lnl2 




5-Eicosaenoic Acid 


20:lnl5 




Plasmalogen fatty acids 


16:0 




» 


18:0 




it 


18:ln7 






18:ln9 




-STEROLS- 


5b-cholestan-3b-ol 


C27H48O 


coprostanol 


5a-cholestan~3b-ol 


C27H48O 


dihydrocholesterol 


5-cholesten-3b-ol 


C27H46O 


cholesterol 


5,24-cholestadien-3b-ol 


C27H44O 


desmosterol 


5-cholestan-25a-methyl-3b-ol 


C28H42O 


campesterol 


5-cholestan-24b-methyl-3b-ol 


C28H42O 


dihydrobrassicasterol 


5-cholesten-24b-ethyl-3b-ol 


C29H50O 


b-sitosteroi 


5,22-cholestadien-24b-ethyI-3b-ol 


C29H48O 


stigmasterol 



By way of example, any subset of the metabolites listed in Table 6 may be included in a 
single lipid metabolite profile. Specific examples of such subsets include those metabolites (1) that 
5 are linked by a biosynthetic or biodegradative pathway, (2) that are precursors or products of each 
other, and so forth. Alternatively, some subsets include those metabolites that show an increasing 
level during progression of a disease or condition such as diabetes, obesity, heart disease, coronary 
artery disease, liver disease, menopause, pregnancy, or hyper- or hypothyroidism; those that show a 
decreasing level; those that are most highly correlated to a particular stage or progression of a 
10 specified disease or condition, and so forth. Alternatively, lipid metabolite profiles may be further 
broken down by the tissue from which metabolites were harvested for the profile. Thus, certain 
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examples of profiles may include a specific class of lipid metabolites that are found only in, or are 
found only to be affected in, a specific tissue, such as heart, nerve (such as brain), liver, adipose, 
connective, or other tissue. In some instances, selection of such tissue-specific profiles may be 
guided by existing knowledge that that tissue (or those tissues) is involved in the disease or condition 
5 under study. 

Particular metabolite profiles are specific for a particular stage of normal tissue (e.g. 9 normal 
heart tissue), a particular nutritional state {e.g., growth on a particular diet), a particular condition or 
disease {e.g., diabetes), or a disease or condition progression {e.g., progression of menopause, for 
instance as a set of profiles from a single subject over a period of time prior to, during, and after onset 

10 of menopause). Each profile includes information on the level of a set of lipid metabolites that are 
linked to the disease or condition being studied {e.g., menopause-progression linked metabolites). 
Such information usually includes absolute levels of specific metabolites, and may similarly include 
the levels of a class (or classes) of metabolites that are linked by a biochemical pathway, or 
metabolites that are otherwise biochemically related to each other. Results from the lipid metabolite 

15 profiles of an individual subject are often viewed in the context of a test sample compared to a 
baseline or control sample profile, or a known profile compiled from a database of individual 
profiles. 

The levels of lipid metabolites that make up a lipid metabolite profile can be measured in 
any of various known ways, including specifically those methods described herein. In particular, it is 
20 contemplated that any method that can be used to generate a quantitative measurement of individual 
metabolites, particularly a chromatographic method, can be used to generate data for use in the 
described lipid metabolite profiles. 

EXAMPLE 3: Identification of Compounds 

25 The linkage of specific lipid metabolites, or sets of lipid metabolites, and the levels thereof 

(for instance, as shown in a lipid metabolite profile), to a disease, condition, or predilection of an 
individual to suffer from or progress in a disease or condition, can be used to identify compounds that 
are useful in treating, reducing, or preventing that disease or condition, or development or 
progression of the disease or condition. 

30 By way of example, a test compound is applied to a cell, for instance a test cell, and a lipid 

metabolite profile is generated and compared to the equivalent measurements from a test cell that was 
not so treated (or from the same cell prior to application of the test compound). Similarly, in some 
embodiments, the test compound is applied to a test organism, such as a mouse. If application of the 
compound alters level(s) of one or more lipid metabolites (for instance by increasing or decreasing 

35 that level), or changes the lipid metabolite profile, then that compound is selected as a candidate for 
further characterization. 

Control lipid metabolite profiles useful for comparison in such methods may be constructed 
from, for instance, normal tissue or cells, tissue or cells taken from a subject known to suffer from the 
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target disease/condition or a specific stage of that disease/condition, tissue or cells that have been or 
are being subject to a treatment for that disease or condition, and/or a tissue or cells taken from a 
subject known to suffer from a different disease/condition or stage thereof. In the latter example, the 
different disease/condition may be a disease or condition that is known to affect a similar set or 
5 subset of lipid metabolites, known to be influenced by similar drugs or treatments, or is not related to 
the target disease/condition with any currently identified correlation. 

This invention provides methods for generating metabolite profiles, particularly lipid 
metabolite profiles, and assembling such profiles into consistent, comparable, minable metabolomic 

10 databases. The invention further provides methods for mining metabolomic databases in order to 
identify and understand metabolome-wide effects, for instance those effects influenced by 
pharmaceuticals, genes, toxins, diet, or the environment Also provided are databases, means for 
accessing and mining such databases, and systems for such. It will be apparent that the precise 
details of the methods described may be varied or modified without departing from the spirit of the 

15 described invention. We claim all such modifications and variations that fall within the scope and 
spirit of the claims below. 



n 

WO 03/005628 



PCT/US02/21426 



-57- 
CLAIMS 

We claim: 

5 1 . A method of generating a database comprising: 

obtaining a plurality of quantified metabolic profile from a plurality of biological samples, 
wherein each quantified metabolic profile comprises a quantified measurement of a metabolite and 
wherein the quantified measurement is obtained using an internal standard for the metabolite so that 
the quantified measurement is integratable to a database, and 
10 assembling the plurality of metabolite profiles into a database. 

2, The method of claim 1, wherein each quantified metabolic profile comprises a 
quantified measurement of a plurality of metabolites. 

3. The method of claim 1, wherein at least one of the biological samples comprises a 
biological fluid or tissue sample. 

15 4. The method of claim 1 , wherein the biological samples are selected from individual 

subjects or archival repositories or some of both. 

5. The method of claim 1, wherein at least one of the biological samples is obtained 
from an animal model. 

6. The method of claim 1, wherein at least one of the biological samples is taken from 
20 an animal. 

7. The method of claim 6, wherein the animal is a human, mouse, rat, monkey, 
chicken, cat, dog, pig, horse, cow, fruit fly, or worm.. 

8. The method of claim 6, wherein the animal is a mammal. 

9. The method of claim 1, wherein the biological sample is obtained from an organism 
25 that is subject to a condition. 

10. The method of claim 9, wherein the condition is a trait of the organism from which 
the biological sample is obtained. 

1 1. The method of claim 9, wherein the condition is a genotype. 

12. The method of claim 1 1, wherein the genotype comprises a genetic knockout. 
30 13. The method of claim 9, wherein the condition comprises a dietary limitation or 

supplementation. 

14. The method of claim 9, wherein the condition comprises a disease or disease state. 

15. The method of claim 9, wherein the condition comprises application of a toxin or 
suspected toxin. 

35 16. The method of claim 9, wherein the condition comprises application of a 

pharmaceutical or therapeutic agent or candidate agent to the organism. 

17. The method of claim 9, wherein the condition comprises an increase in exercise, a 
decrease in exercise, or a change in an exercise regimen of the subject. 
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18. The method of claim 1, wherein the metabolite is selected from the group 
consisting of lipids, carbohydrates, vitamins, co- factors, and pigments. 

19. The method of claim 1, wherein the metabolite is a lipid metabolite. 

20. The method of claim 19, wherein the lipid metabolite is selected from the group 
5 consisting of tetradecanoic acid, pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, 

octadecanoic acid, eicosanoic acid, docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9- 
hexadecenoic acid, 11-octadecenoic acid, 9-octadecenoic acid, 1 1-eicosenoic acid, 5,8,11- 
eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 9,12,15-octadecatrienoic acid, 
6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 8,11,14,17-eicosictetraenoic acid, 

10 5,8,1 1,14,17-eicosapentaenoic acid, 7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19- 

docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic acid, 9,12-octadecadienoic acid, 6,9,12- 
octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14-eicosatrienoic acid, 5,8,11,14- 
eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,13,16-docosicatetraenoic acid, 4,7,10,13,16- 
docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans-octadecenoic acid, 8-eicosaenoic acid, 5- 

15 eicosaenoic acid, plasmalogen fatty acids, 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-oI, 
5,24-cholestadien-3b-ol,5-cholestan-25a-methyl-3b-ol,5-cholestan-24b-methyl-3b-ol, 5-cholesten- 
24b-ethyl-3b-ol, and 5,22-cholestadien-24b-ethyl-3b-ol, each as a compound or a component of a 
lipid molecule. 

21. The method of claim 2, wherein the plurality of metabolites comprise two or more 
20 lipid metabolites, and one or more of the lipid metabolites are selected from the group consisting of 

tetradecanoic acid, pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, octadecanoic acid, 
eicosanoic acid, docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9-hexadecenoic acid, 11- 
octadecenoic acid, 9-octadecenoic acid, 1 1-eicosenoic acid, 5,8,1 1-eicosatrienoic acid, 13-docosenoic 
acid, 15-tetracosenoic acid, 9,12,15-octadecatrienoic acid, 6,9,12,15-octadecatetraenoic acid, 

25 1 1,14,17-eicosatrienoic acid, 8,1 1,14, 17-eicosictetraenoic acid, 5,8,1 1,14,17-eicosapentaenoic acid, 
7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19-docosahexaenoic acid, 6,9,12,15,18,21- 
tetracoshexaenoic acid, 9,12-octadecadienoic acid, 6,9,12-octadecatrienoic acid, 11,14-eicosadienoic 
acid, 8,11,14-eicosatrienoic acid, 5,8,1 1,14-eicosicatetraenoic acid, 13,16-docsadienoic acid, 
7,10,13,16-docosicatetraenoic acid, 4,7,10,13,16-docosapentaenoic acid, 9-trans-hexadecenoic acid, 

30 9-trans-octadecenoic acid, 8-eicosaenoic acid, 5-eicosaenoic acid, plasmalogen fatty acids, 5b- 
cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 5,24-choIestadien-3b-ol, 5-cholestan-25a- 
methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten-24b-ethyl-3b-ol, and 5,22-cholestadien- 
24b-ethyl-3b-ol, each as a compound or a component of a lipid molecule. 

22. The method of claim 19, wherein the quantified metabolic profile comprises a 
35 quantified measurement of a lipid in a lipid class. 

23. The method of claim 22, wherein the quantified measurement of the lipid in the 
lipid class is obtained using an internal standard for the lipid class. 
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24. The method of claim 22, wherein the lipid is selected from the group consisting of 
fatty acid 16:0, 18:0, 16:ln7; 18:ln7; 18:ln9; 18:3n3; 20:5n3; 22:5n3; 22:6n3; 18:2n6; 18:3n6; 
20:3n6; and 20:4n6. 

25. The method of claim 22, wherein the lipid is a sterol selected from the group 

5 consisting of 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten~3b-ol, 5,24-cholestadien-3b-ol, 5- 
cholestan-25a-methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten-24b-ethyl-3b-ol, and 5,22- 
cholestadien-24b-ethyl-3b-ol. 

26. The method of claim 22, wherein the lipid class is selected from the group 
consisting of lyso-phosphatidylcholine, sphingomyelin, phosphatidylcholine, phosphatidylserine, 

10 phosphatidylinositol, phosphatidylethanolamine, cardiolipin, free fatty acids, monoacylglycerides, 
diacylglycerides, triacylgiycerides, and cholesterol esters. 

27. The method of claim 23, wherein the internal standard is selected from the group 
consisting of diheptadecanoyl phosphatidylcholine, dipentadecaenoyl phosphatidylethanolamine, 
tetraheptadecenoyl cardiolipin, diheptadecenoyl phosphatidylserine, pentadecenoyl sphingomyelin, 

15 heptadecanoyl lyso-phosphatidylcholine, tripheptadecaenoyl glyceride, pentadecaenoic acid, 
heptadecanoic cholesterol ester and free fucosterol. 

28. The method of claim 23, wherein the internal standard is heptadecanoic 1- 
heptadecanoyl-2-lyso-phosphatidycholine for the lipid class of lysophospholipids, N-pentadecenoyl- 
D-erythro-sphingosylphorylcholine for the lipid class of sphingomyelin, 1,2 

20 diheptadecanoylphosphatidylcholine for the lipid class of phosphatidylcholine, 1,2- 

diheptadecenoylphosphatidylethanolamine for the lipid class of phosphatidylethanolamine, 1,2- 
diheptadecenoylphosphatidylserine for the lipid class of phosphatidylserine, pentadecaenoic acid for 
the lipid class of free fatty acids, triheptadecaenoic acid for the lipid class of triacylgiycerides, 
l,r,2,2-tetraheptadecaenoyl cardiolipin for the lipid class of cardiolipin, cholesteryl heptadecanoate 

25 for the lipid class of cholesterol esters and stigmasterol for the lipid class of free sterols. 

29. A method comprising permitting for a fee access to the database of claim 1. 

30. The method of claim 29, wherein access is through a computer interface. 

31. A method of generating quantitative lipomic data, comprising: 

separating a biological sample into fractions based on a plurality of lipid classes, wherein at 
30 least one quantitative internal standard is included for each lipid class; and 

measuring the quantity of a plurality of lipid metabolites in the fractions. 

32. The method of claim 31, wherein the plurality of lipid classes comprises lyso- 
phosphatidylcholines, sphingomyelins, phosphatidylcholines, phosphatidylserines, 
phosphatidylinositols, phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, 

35 diacylglycerides, triacylgiycerides, or cholesterol esters. 

33. The method of claim 3 1 , wherein the plurality of lipid metabolites comprises at 
least one of tetradecanoic acid, pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, 
octadecanoic acid, eicosanoic acid, docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9- 
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hexadecenoic acid, 11-octadecenoic acid, 9-octadecenoic acid, 11-eicosenoic acid, 5,8,11- 
eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 9,12,15-octadecatrienoic acid, 
6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 8,11,14,17-eicosictetraenoic acid, 
5,8,1 1,14,17-eicosapentaenoic acid, 7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19- 
5 docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic acid, 9, 12-octadecadienoic acid, 6,9,12- 
octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14-eicosatrienoic acid, 5,8,11,14- 
eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,13,16-docosicatetraenoic acid, 4,7,10,13,16- 
docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans-octadecenoic acid, 8-eicosaenoic acid, 5- 
eicosaenoic acid, piasmalogen fatty acids, 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 
10 5,24-choIestadien-3b-ol, 5-cholestan-25a-methyI-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten- 
24b-ethyl-3b-ol, or 5,22-cholestadien-24b-ethyl-3b-oI, each as a compound or a component of a lipid 
molecule. 

34. The method of claim 3 1 , wherein separating comprises chromatography. 

35. The method of claim 31, wherein measuring comprises chromatography. 

15 36. A method for presenting analysis of a plurality of individual quantitative lipid 

metabolite profiles, comprising: 

designating the plurality of individual quantitative metabolite profiles; 
identifying at least one difference or at least one similarity in a metabolite in the plurality of 
individual quantitative metabolite profiles; and 
20 displaying at least one difference or at least one similarity in the metabolite in the plurality 

of individual quantitative metabolite profiles. 

37. The method of claim 36, wherein displaying generates a web page for viewing. 

38. The method of claim 37, wherein the web page comprises a representation of a heat 

map. 

25 39. The method of claim 37, wherein the web page comprises a representation of a 

targeting chart. 

40. A method of determining a metabolic effect of a condition, comprising 
subjecting a subject to the condition; 

taking a biological sample from the subject; 
30 analyzing the biological sample to produce a test lipomic profile for the subject; 

comparing the test lipomic profile for the subject with a control lipomic profile; and 
drawing conclusions about the metabolic effect of the condition based on differences or 
similarities between the test lipomic profile and the control lipomic profile. 

41. The method of claim 40, wherein the condition is a genotype. 

35 42. The method of claim 4 1 , wherein the genotype comprises a genetic knockout. 

43. The method of claim 40, wherein the condition comprises a dietary limitation or 
supplementation. 

44. The method of claim 40, wherein the condition comprises a disease or disease state. 
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45. The method of claim 40, wherein the condition comprises application of a toxin or 
suspected toxin. 

46. The method of claim 40, wherein the condition comprises application of a 
pharmaceutical agent or candidate agent. 

5 47. The method of claim 40, wherein the control lipomic profile is a compiled lipomic 

profile assembled from a plurality of individual lipomic profiles. 

48. The method of claim 40, wherein the control lipomic profile is a pre-condition 
lipomic profile from the subject 

49. The method of claim 40, which method is a method of determining drug or 
1 0 treatment effectiveness, comprising 

applying a drug or treatment to a subject; 
taking a biological sample from the subject; 

analyzing the biological sample to produce a test lipomic profile for the subject; 
comparing the test lipomic profile for the subject with a control lipomic profile; and 
15 drawing conclusions about the effectiveness of the drug or treatment based on differences or 

similarities between the test lipomic profile and the control lipomic profile. 

50. The method of claim 49, wherein the drug or treatment is a hormone or hormone 
treatment. 

51. The method of claim 49, wherein the drug or treatment influences obesity or 

20 diabetes. 

52. The method of claim 40, which method is a method of determining likelihood of 
success of a treatment or procedure, comprising 

subjecting a subject to the treatment or procedure; 
taking a biological sample from the subject; 
25 analyzing the biological sample to produce a test lipomic profile for the subject; 

comparing the test lipomic profile for the subject with a control lipomic profile; and 
drawing conclusions about the likelihood of success of a treatment or procedure based on 
differences or similarities between the test lipomic profile and the control lipomic profile. 

53. The method of claim 52 wherein the treatment or procedure comprises an organ 
30 transplant. 

54. The method of claim 52, wherein the treatment or procedure comprises a dietary 
limitation or supplementation. 

55. The method of claim 52, wherein the treatment or procedure comprises application 
of a pharmaceutical agent or candidate agent. 

35 56. A method for providing metabolic information comprising providing electronic 

access to the database of claim 40. 

57. The method of claim 56, wherein the electronic access comprises access through 
the internet. 
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58. A method of determining the metabolic effect of an agent comprising 
obtaining a quantified metabolic profile from a biological sample treated with or 

without an agent, wherein the quantified metabolic profile comprises a quantified measurement of a 
metabolite and wherein an increase or decrease of a quantified measurement of a metabolite caused 
5 by the agent is indicative of a metabolic effect of the agent. 

59. The method of claim 58, wherein the agent is a therapeutic agent or a candidate 
therapeutic agent. 

60. A method of generating a disease condition-linked quantified metabolic profile 
comprising 

10 obtaining a first quantified metabolic profile from a first biological sample from a 

first individual having a disease condition and a second quantified metabolic profile from a second 

biological sample from a second individual of a normal condition, and 

comparing the first quantified metabolic profile with the second quantified 

metabolic profile, wherein a disease condition-linked quantified metabolic profile comprises a 
15 variation of a quantified measurement of a metabolite between the first and second quantified 

metabolic profiles. 

61. A method of diagnosing a disease condition or predisposition thereto of a subject 
comprising 

generating a disease condition-linked quantified metabolic profile according to the 
20 method of claim 60, and 

obtaining a subject quantified metabolic profile from a biological sample of a 
subject, wherein a subject quantified metabolic profile identical or substantially similar to the disease 
condition-linked quantified metabolic profile is indicative of the disease condition or the 
predisposition thereto. 

25 62. A method of using a quantitative lipomic database in disease diagnosis, prognosis, 

or prediction, comprising screening the lipomic database for a lipid metabolite profile that is linked to 
the disease. 

63. A method of screening for a compound useful in treating, reducing, or preventing a 
disease or progression of a disease, comprising: 
30 determining if application of a test compound alters a disease-related lipid metabolite profile 

so that the profile less closely resembles a disease-linked profile than it did prior to such treatment; 
and 

selecting a compound that so alters the disease-related lipid metabolite profile, 
wherein the disease-related lipid metabolite profile includes a level of at least one of the following 
35 metabolites: tetradecanoic acid, pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, 

octadecanoic acid, eicosanoic acid, docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9- 
hexadecenoic acid, 1 1-octadecenoic acid, 9-octadecenoic acid, 11-eicosenoic acid, 5,8,11- 
eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 9,12,15-octadecatrienoic acid, 
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6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 8,11,14,17-eicosictetraenoic acid, 
5,8,1 1,14,17-eicosapentaenoic acid, 7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19- 
docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic acid, 9,12-octadecadienoic acid, 6,9,12- 
octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14-eicosatrienoic acid, 5,8,11,14- 
5 eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,13,16-docosicatetraenoic acid, 4,7,10,13,16- 
docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans-octadecenoic acid, 8-eicosaenoic acid, 5- 
eicosaenoic acid, plasmalogen fatty acids, 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 
5,24-cholestadien-3b-ol, 5-cholestan-25a-methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten- 
24b-ethyl-3b-ol, or 5,22-cholestadien-24b-ethyl-3b-ol, each as a free compound or a component of a 
10 lipid molecule. 

64. A method for screening for an agent having an effect on a disease condition, 
comprising: 

obtaining a first quantified metabolic profile from a first biological sample from an 
individual having a disease condition and treated with a test agent, and 
15 comparing the first quantified metabolic profile with a disease condition-linked 

quantified metabolic profile generated according to the method of claim 60, wherein a change in the 
first quantified metabolic profile caused by the test agent and associated with the disease condition- 
linked quantified metabolic profile is indicative that the test agent has an effect on the disease 
condition. 

20 65. The method of claim 64, wherein the disease condition comprises a genotype, a 

dietary limitation or supplementation, a disease or disease state, a treatment with a compound, or a 
combination of two or more thereof. 

66. A method of identifying a therapeutic target for a disease condition comprising 
generating a disease condition-linked quantified metabolic profile according to the method 

25 of claim 60, wherein a variation of a quantified measurement of a metabolite is indicative of the 
metabolite as a therapeutic target for the disease condition. 

67. The method of any one of claims 1 through 66, further comprising generating a 
printed report. 

68. A database generated according to the method of claim 1, the database comprising 
30 a profile table including a quantified metabolic profile from a biological sample 

from an individual having a condition, wherein the quantified metabolic profile comprises a 
quantified measurement of a metabolite and wherein the quantified measurement is obtained using an 
internal standard for the metabolite so that the quantified measurement is integratable into a database. 

69. The database of claim 68, wherein each quantified metabolic profile comprises a 
35 quantified measurement of a plurality of metabolites. 

70. The database of claim 68, wherein at least one of the biological samples comprises 
a biological fluid or tissue sample. 
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71. The database of claim 68, wherein the biological samples are selected from 
individual subjects or archival repositories or some of both. 

72. The database of claim 68, wherein at least one of the biological samples is obtained 
from an animal model. 

5 73. The database of claim 68, wherein at least one of the biological samples is taken 

from an animal. 

74. The database of claim 73, wherein the animal is a human, mouse, rat, monkey, 
chicken, cat, dog, pig, horse, cow, fruit fly, or worm.. 

75. The database of claim 73, wherein the animal is a mammal. 

10 76. The database of claim 68, wherein the biological sample is obtained from an 

organism that is subject to a condition. 

77. The database of claim 76, wherein the condition is a trait of the organism from 
which the biological sample is obtained. 

78. The database of claim 76, wherein the condition is a genotype. 

15 79. The database of claim 78, wherein the genotype comprises a genetic knockout. 

80. The database of claim 76, wherein the condition comprises a dietary limitation or 
supplementation. 

81. The database of claim 78, wherein the condition comprises a disease or disease 

state. 

20 82. The database of claim 78, wherein the condition comprises application of a toxin or 

suspected toxin. 

83. The database of claim 78, wherein the condition comprises application of a 
pharmaceutical or therapeutic agent or candidate agent to the organism. 

84. The database of claim 78, wherein the condition comprises an increase in exercise, 
25 a decrease in exercise, or a change in an exercise regimen of the subject. 

85. The database of claim 68, wherein the metabolite is selected from the group 
consisting of lipids, carbohydrates, vitamins, co-factors, and pigments. 

86. The database of claim 68, wherein the metabolite is a lipid metabolite. 

87. The database of claim 86, wherein the lipid metabolite is selected from the group 
30 consisting of tetradecanoic acid, pentadecanoic acid, hexadecanoic acid, heptadecanoic acid, 

octadecanoic acid, eicosanoic acid, docosanoic acid, tetracosanoic acid, 9-tetradecenoic acid, 9- 
hexadecenoic acid, 1 1-octadecenoic acid, 9-octadecenoic acid, 11-eicosenoic acid, 5,8,11- 
eicosatrienoic acid, 13-docosenoic acid, 15-tetracosenoic acid, 9,12,15-octadecatrienoic acid, 
6,9,12,15-octadecatetraenoic acid, 11,14,17-eicosatrienoic acid, 8,11,14,17-eicosictetraenoic acid, 
35 5,8,1 1,14,17-eicosapentaenoic acid, 7,10,13,16,19-docosapentaenoic acid, 4,7,10,13,16,19- 

docosahexaenoic acid, 6,9,12,15,18,21-tetracoshexaenoic acid, 9,12-octadecadienoic acid, 6,9,12- 
octadecatrienoic acid, 11,14-eicosadienoic acid, 8,11,14-eicosatrienoic acid, 5,8,11,14- 
eicosicatetraenoic acid, 13,16-docsadienoic acid, 7,10,1 3, 16-docosicatetraenoic acid, 4,7,10,13,16- 
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docosapentaenoic acid, 9-trans-hexadecenoic acid, 9-trans-octadecenoic acid, 8-eicosaenoic acid, 5- 
eicosaenoic acid, plasmalogen fatty acids, 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-cholesten-3b-ol, 
5,24-cholestadien-3b-ol, 5-cholestan-25a-methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten- 
24b-ethyl-3b-ol, and 5,22-cholestadien-24b-ethyl-3b-ol, each as a compound or a component of a 
5 lipid molecule. 

88. The database of claim 86, wherein the quantified metabolic profile comprises a 
quantified measurement of a lipid in a lipid class. 

89. The database of claim 86 wherein the quantified measurement of a lipid in a lipid 
class is obtained using an internal standard for the lipid class. 

10 90. The database of claim 89, wherein the lipid is selected from the group consisting of 

fatty acid 16:0, 18:0, 16:ln7; 18:ln7; 18:ln9; 18:3n3; 20:5n3; 22:5n3; 22:6n3; 18:2n6; 18:3n6; 
20:3n6; and 20:4n6. 

91. The database of claim 89, wherein the lipid is a sterol selected from the group 
consisting of 5b-cholestan-3b-ol, 5a-cholestan-3b-ol, 5-choIesten-3b-ol, 5,24-cholestadien-3b-ol, 5- 

15 cholestan-25a-methyl-3b-ol, 5-cholestan-24b-methyl-3b-ol, 5-cholesten-24b-ethyl-3b-ol, and 5,22- 
cholestadien-24b-ethyl-3b-ol. 

92. The database of claim 89, wherein the lipid class is selected from the group 
consisting of lyso-phosphatidylcholine, sphingomyelin, phosphatidylcholine, phosphatidylserine, 
phosphatidylinositol, phosphatidylethanolamine, cardiolipin, free fatty acids, monoacylglycerides, 

20 diacylglycerides, triacylglycerides, and cholesterol esters. 

93. The database of claim 89, wherein the internal standard is selected from the group 
consisting of diheptadecanoyl phosphatidylcholine, dipentadecaenoyl phosphatidylethanolamine, 
tetraheptadecenoyl cardiolipin, diheptadecenoyl phosphatidylserine, pentadecenoyl sphingomyelin, 
heptadecanoyl lyso-phosphatidylcholine, tripheptadecaenoyl glyceride, pentadecaenoic acid, 

25 heptadecanoic cholesterol ester and free fucosterol. 

94. The database of claim 89, wherein the internal standard is heptadecanoic 1- 
heptadecanoyl-2-lyso-phosphatidycholine for the lipid class of lysophospholipids, N-pentadecenoyl- 
D-erythro-sphingosylphorylcholine for the lipid class of sphingomyelin, 1,2 
diheptadecanoylphosphatidylcholine for the lipid class of phosphatidylcholine, 1,2- 

30 diheptadecenoylphosphatidylethanolamine for the lipid class of phosphatidylethanolamine, 1,2- 

diheptadecenoylphosphatidylserine for the lipid class of phosphatidylserine, pentadecaenoic acid for 
the lipid class of free fatty acids, triheptadecaenoic acid for the lipid class of triacylglycerides, 
l,r,2,2'-tetraheptadecaenoyl cardiolipin for the lipid class of cardiolipin, cholesteryl heptadecanoate 
for the lipid class of cholesterol esters and stigmasterol for the lipid class of free sterols. 

35 95. A computer readable medium containing the database of claim 68. 

96. The computer-readable medium of claim 95, wherein the metabolomic database is a 
lipomic database, and wherein at least one record comprising quantitative data for a plurality of lipid 
metabolites from a biological sample. 
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97. The computer-readable medium of claim 95, wherein the database comprises 
records that comprise data from animal, plant, or microbial samples. 

98. The computer-readable medium of claim 95, wherein the animal samples comprises 
samples from humans, mice, rats, monkeys, chickens, cats, dogs, pigs, horses, cows, fruit flies, or 

5 worms. 

99. The database of claim 68 further comprising a sample item table including a sample 
record for the quantified metabolic profile, and a condition item table including a condition record for 
the quantified metabolic profile. 

100. The database of claim 99 further comprising a genomic item table including a 
10 genomic profile for the quantified metabolic profile. 

101 . The database of claim 99 further comprising an expression item table including a 
gene expression profile for the quantified metabolic profile. 

102. The database of claim 99 further comprising a protein item table including a 
proteomic profile for the quantified metabolic profile. 

15 103. The database of claim 99 further comprising a character item table including a 

character profile for the quantified metabolic profile. 

104. The database of claim 99 further comprising a filter item table including a filter of 
quantified metabolic profile for a desired condition. 

105. A user interface for operatively working with a processor to affect operation of the 
20 database of claim 99 comprising 

means for providing settings for selecting a set of samples, 
means for providing settings for selecting a set of conditions, 
means for providing settings for selecting a set of metabolites, and 
means for displaying quantified metabolic profiles corresponding to the selected 
25 samples and conditions, wherein each displayed quantified metabolic profile consists of the 
quantified measurements of the selected metabolites. 

106. The user interface of claim 105 further comprising a display area which displays 
the value of a quantified measurement of a metabolite within the quantified metabolic profiles of the 
selected samples and conditions. 

30 107. The user interface of claim 105 further comprising 

means for comparing quantified metabolic profiles corresponding to a first set of 
selected samples and conditions to the quantified metabolic profiles corresponding to a second set of 
selected samples and conditions, and 

means for displaying the comparison. 
35 108. The user interface of claim 105, the user interface comprising: 

for a plurality of metabolites, a presentation of an observed quantity of at least one 
metabolite for a first biological sample with respect to an observed quantity of the at least one 
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metabolite for a second biological sample, wherein the presentation is operable to accept a user 
indication that further information is desired with respect to a selected metabolite. 

109. A computer implemented method for operating a relational database comprising 
creating a profile table including a quantified metabolic profile from a biological 

5 sample from an individual having a condition, wherein the quantified metabolic profile comprises a 
quantified measurement of a metabolite and wherein the quantified measurement is obtained using an 
internal standard for the metabolite so that the quantified measurement is integratable into a database, 
creating a sample item table including a sample record for the quantified metabolic 

profile, 

10 creating a condition item table including a condition record for the quantified 

metabolic profile, and 

storing data in the profile table, the sample item table, and the condition item table, 
wherein each quantified metabolic profile corresponds to a sample record and a condition record. 

1 10. A computer system for analyzing quantitative lipid metabolomic information, 
15 comprising: 

a processor; and 

a storage medium storing a relational database accessible by the processor, wherein the 
storage medium has stored thereon: 

the relational database comprising: 
20 a first table including a plurality of records, wherein at least one of the records 

includes quantitative data for a plurality of lipid metabolites. 

111. A computer system for use with the method of claim 1 10, comprising 
a processor, and 

a storage medium storing a relational database accessible by the processor, wherein 
25 the storage medium having stored thereon a relational database comprising 

a profile table including a quantified metabolic profile from a biological sample of 
a condition, wherein the quantified metabolic profile comprises a quantified measurement of a 
metabolite and wherein the quantified measurement is obtained using an internal standard for the 
metabolite so that the quantified measurement is integratable into the relational database, 
30 a sample item table including a sample record for the quantified metabolic profile, 

and 

a condition item table including a condition record for the quantified metabolic 

profile. 

1 12. An internal standard composition for lipid analysis of a sample, the composition 
35 comprising a plurality of lipid species, wherein at least one lipid species comprises at least one 

monounsaturated fatty acid of formula N:lnR, wherein N is an odd integer equal to or larger than 
three, wherein R is any integer equal to or less than N-l , and wherein at least one of the plurality of 
lipid species is a free fatty acid, a sphingomyelin, a cardiolipin, a phosphatidylethanolamine, a 
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phosphatidic acid, a phosphytidylcholine, a phosphatidylserine, a phosphatidylinositol, a 
phosphatidylglycerol, a monoacylglyceride, a diacylglyceride, a triacylgiyceride, a sterol ester, or a 
lysophospholipid. 

1 13. The internal standard composition of claim 1 12, wherein each lipid species 
5 comprises at least one such monounsaturated fatty acid. 

1 14. The internal standard composition of claim 1 12, comprising at least one lipid 
species comprising at least one monounsaturated fatty acid wherein N is 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, or 25. 

115. The internal standard composition of claim 1 14, comprising at least one lipid 
10 species comprising at least one monounsaturated fatty acid wherein N is 15, 17, or 19. 

1 16. The internal standard composition of claim 1 12, wherein each of the plurality of 
lipid species is representative of a specific lipid class. 

117. The internal standard composition of claim 1 16, wherein each of the plurality, of 
lipid species is present in the composition at a concentration equivalent to the concentration of a 

15 sample lipid species from the same lipid class as represented by that lipid species. 

118. The internal standard composition of claim 1 12, wherein at least one of the 
monounsaturated fatty acids is not present in the sample. 

119. The internal standard composition of claim 118, wherein each of the 
monounsaturated fatty acids is not present in the sample. 

20 120. The internal standard composition of claim 1 12, wherein at least one of the lipid 

species is a lysophospholipid, and the lysophospholipid has the formula l-acyl,2-lyso-M or l-lyso,2- 
acyl-M, and where M is phosphytidylcholine, phosphatidylserine, phosphatidylethanolamine, 
phosphatidylinositol, phosphatidylglycerol, or phosphatidic acid. 

121. The internal standard composition of claim 116, comprising at least three lipid 

25 species. 

122. The internal standard composition of claim 116, comprising at least four lipid 

species. 

123. The internal standard composition of claim 1 16, comprising at least five lipid 

species. 

30 124. The internal standard composition of claim 1 16, comprising at least eight lipid 

species. 

125. The internal standard composition of claim 1 16, comprising at least ten lipid 

species. 

126. The internal standard composition of claim 116, wherein each lipid class is lyso- 
35 phosphatidylcholines, sphingomyelins, phosphatidylcholines, phosphatidylserines, 

phosphatidylinositols, phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, 
diacylglycerides, triacylglycerides, cholesterol esters, phosphatidic acids, phosphatidylglycerols, 
CDP-diacylglycerols, lysocardiolipins, or lysophosphatidylethanolamines. 
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127. An internal standard for phosphatidylethanolamines, phosphatidic acids, 
phosphatidylcholines, phosphatidylserines, phosphatidylinositols, phosphatidylglycerols, 
diacylglycerides, or triacylglycerides, comprising a first fatty acid of formula N:0 in the sn-1 position 
and a second fatty acid of formula M:Y in the sn-2 position, where Y is an integer greater than 0. 
5 128. The internal standard of 127, which is an internal standard for 

phosphatidylethanolamines, wherein the internal standard comprises a phosphatidylethanolamine that 
comprises the first fatty acid and the second fatty acid. 

129. The internal standard of 127, which is an internal standard for phosphatidic acids, 
wherein the internal standard comprises a phosphatidic acid that comprises the first fatty acid and the 

10 second fatty acid. 

130. The internal standard of 127, which is an internal standard for 
phosphytidylcholines, wherein the internal standard comprises a phosphytidylcholine that comprises 
the first fatty acid and the second fatty acid. 

131. The internal standard of 127, which is an internal standard for phosphatidylserines, 
15 wherein the internal standard comprises a phosphatidylserine that comprises the first fatty acid and 

the second fatty acid. 

132. The internal standard of 127, which is an internal standard for 
phosphatidylinositols, wherein the internal standard comprises a phosphatidylinositol that comprises 
the first fatty acid and the second fatty acid. 

20 133. The internal standard of 127, which is an internal standard for 

phosphatidylglycerols, wherein the internal standard comprises a phosphatidylglycerol that comprises 
the first fatty acid and the second fatty acid. 

134. The internal standard of 127, which is an internal standard for diacylglycerides, 
wherein the internal standard comprises a diacylglyceride that comprises the first fatty acid and the 

25 second fatty acid. 

135. The internal standard of 127, which is an internal standard for triacylglycerides, 
wherein the internal standard comprises a triacylglyceride that comprises the first fatty acid and the 
second fatty acid. 

136. The internal standard of 135, wherein the triacylglyceride further comprises a third 
30 fatty acid that is different from the first fatty acid and the second fatty acid. 

137. An internal standard for triacylglycerides or cardiolipins, comprising a first fatty 
acid of formula N:X at a first position, a second fatty acid of formula M:Y at a second position, and a 
third fatty acid of formula 0:Z at a third position, wherein N:X, M:Y, and OrZ are different from 
each other. 

35 138. The internal standard of claim 137, wherein the first position is sn-1 and X is 0. 

139. The internal standard of claim 137, wherein at least Y or Z is 1. 

140. The internal standard of claim 138, wherein both Y and Z are 1. 
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141. The internal standard of claim 137, wherein N:X is 17:0, M:Y is 19:1, and 0:Z is 
19:1 and wherein the first position is sn-1, the second position is sn-2, and the third position is sn-3. 

142. The internal standard of claim 137, wherein N:X is 17:0, M:Y is 19:1, and 0:Z is 
19:2 and wherein the first position is sn-1, the second position is sn-2, and the third position is sn-3. 

5 143. The internal standard of claim 137, which is an internal standard for 

triacylglycerides, wherein the internal standard comprises a triacylglyceride that comprises the first 
fatty acid, the second fatty acid, and the third fatty acid. 

144. The internal standard of claim 137, which is an internal standard for cardiolipins, 
wherein the internal standard comprises a cardiolipin that comprises the first fatty acid, the second 

10 fatty acid, and the third fatty acid. 

145. The internal standard of claim 144, wherein the first position is sn-1, the second 
position is sn-2, and the third position is either sn-1 ' or sn-2'. 

146. The internal standard of claim 145, wherein the third position is sn-1', and X and Z 

are 0. 

15 141 ' An internal standard composition for lipid analysis of a sample, the composition 

comprising a plurality of lipid species, wherein at least one lipid species comprises at least one 
polyunsaturated fatty acid of formula N:lnR, wherein N is an even integer equal to or larger than six, 
wherein R is any integer equal to or less than N-l, and wherein the desaturations occur in positions 
different from the positions of desaturations in fatty acids present in the sample, and wherein at least 

20 one of the plurality of lipid species is a free fatty acid, a sphingomyelin, a cardiolipin, a 

phosphatidylethanolamine, a phosphatide acid, a phosphytidylcholine, a phosphatidylserine, a 
phosphatidylinositol, a phosphatidylglycerol, a monoacylglyceride, a diacytglyceride, a 
triacylglyceride, a sterol ester, or a lysophospholipid. 

148. The internal standard composition of claim 147, wherein each lipid species 
25 comprises at least one such polyunsaturated fatty acid. 

149. The internal standard composition of claim 147, comprising at least one lipid 
species comprising at least one monounsaturated fatty acid wherein N is 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, or 26. 

150. The internal standard composition of claim 149, comprising at least one lipid 
30 species comprising at least one monounsaturated fatty acid wherein N is 14, 16, 18, or 20. 

151. The internal standard composition of claim 147, wherein each of the plurality of 
lipid species represents a different lipid class. 

152. The internal standard composition of claim 151, wherein each of the plurality of 
lipid species is present in the composition at a concentration equivalent to the concentration of a 

35 sample lipid species from the same lipid class as represented by that lipid species. 

153 . The internal standard composition of claim 147, wherein at least one of the 
polyunsaturated fatty acids is not present in the sample. 
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154. The internal standard composition of claim 153, wherein each of the 
polyunsaturated fatty acids is not present in the sample. 

155. The internal standard composition of claim 147, wherein at least one of the lipid 
species is a lysophospholipid, and the lysophospholipid has the formula l-acyl,2-lyso-M or l-lyso,2- 

5 acyl-M, and where M is phosphytidylcholine, phosphatidylserine, phosphatidylethanolamine, 
phosphatidylinositol, phosphatidylglycerol, or phosphatidic acid. 

156. The internal standard composition of claim 15 1 , comprising at least three lipid 

species. 

157. The internal standard composition of claim 151, comprising at least four lipid 

10 species. 

158. The internal standard composition of claim 15 1 , comprising at least five lipid 

species. 

159. The internal standard composition of claim 151, comprising at least eight lipid 

species. 

15 160. The internal standard composition of claim 151, comprising at least ten lipid 

species. 

161. The internal standard composition of claim 151, wherein each lipid class is lyso- 
phosphatidylcholines, sphingomyelins, phosphatidylcholines, phosphatidylserines, 
phosphatidylinositois, phosphatidylethanolamines, cardiolipins, free fatty acids, monoacylglycerides, 

20 diacylglycerides, triacylglycerides, cholesterol esters, phosphatidic acids, phosphatidylglycerols, 
CDP-diacylglycerols, lysocardiolipins, or lysophosphatidylethanolamines. 

162. An internal standard composition for lipid analysis of a sample, comprising at least 
four lipid species each comprising one or more fatty acid constituents, wherein the lipid species are 
selected from the group consisting of free fatty acid, a sphingomyelin, a cardiolipin, a 

25 phosphatidylethanolamine, a phosphatidic acid, a phosphytidylcholine, a phosphatidylserine, a 
phosphatidylinositol, a phosphatidylglycerol, a monoacylglyceride, a diacylglyceride, a 
triacylglyceride, a sterol ester, and a lysophospholipid, and wherein each fatty acid constituent of 
each lipid species is a saturated fatty acid having the formula N:0, wherein N is an odd integer equal 
to or larger than three. 

30 163. A method of generating quantitative lipomic data, comprising: 

separating a biological sample into fractions based on a plurality of lipid classes, wherein at 
least one quantitative internal standard is included for each lipid class for which quantitative data is 
desired, wherein the quantitative internal standard is the internal standard of any one of claims 112 
through 162; and 

35 measuring the quantity of a plurality of lipid metabolites in the fractions. 
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INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US02/21426 



Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 
This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. Claim Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2 - LZj Claim Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 



3 - [ZD Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 
6.4(a). 

Box n Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 

This international Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



1 . | | As all required additional search fees were timely paid by the applicant, this international search report covers all 

searchable claims. 

2 - E] A 5 all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
^ payment of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos.: 



4. EZI No required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention First mentioned in the claims; it is covered by claims Nos.: 

Remark on Protest j__ J The additional search fees were accompanied by the applicant's protest. 

[_ I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet(l)) (July 1998) 



INTERNATIONAL SEARCH REPORT 



PCT/US02/21426 



BOX H. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees must be 
paid. 

Group I, claim(s) 1-30 and 67-104, drawn to a method of creating a database and the database created. 
Group II, claim(s) 31-39 and 67, drawn to a method of generating quantitative lipomic data. 
Group III, claim(s) 40-57 and 67, drawn to a method of determining the metabolic effect of a condition. 
Group IV, claim(s) 58-60 and 67, drawn to a method of determining the metabolic effect of an agent. 
Group V, claim(s) 61 and 67, drawn to a method of diagnosing a disease or prediposition thereto. 
Group VI, claim(s) 62 and 67, drawn to a method of using a quantitative lipomic database. 

Group VII, claim(s) 63 and 67, drawn to a method of screening a compound for use in treating, preventing, or reducing a disease or 
progression thereof. 

Group VIII, claim(s) 64-65 and 67, drawn to a method of screening for an agent having an effect on a disease condition. 

Group IX, claim(s) 66-67, drawn to a method of identifying a therapeutic target for a disease. 

Group X, claim(s) 105-108, drawn to a user interface for working with a processor. 

Group XI, claim(s) 109, drawn to a computer implemented method for operating a relational database. 

Group XII, claim(s) 110-111, drawn to a computer system for analyzing lipid metabolomic information. 

Group XIII, claim(s) 1 12-126, drawn to an internal lipid standard composition. 

Group XIV, claim(s) 127-136, drawn to an internal standard for specific phospholipids. 

Group XV, claim(s) 137-146, drawn to an internal standard for triacylglycerides or cardioiipins. 

Group XVI, claim(s) 147-162, drawn to an internal standard composition for analysis of lipids. 

Group XVII, claim(s) 163, drawn to a method of generating quantitative lipomic data. 



The inventions listed as Groups I-XVII do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: a method of generating a database 
is known in the art, as are methods of generating lipomic data, and various lipid compositions which may be used as " internal n 
standards. See the Search Report. As at least some of the claimed methods and compositions are known in the art, the claims do not 
recite a special technical feature and are not linked by a general inventive concept. 



FormPCT/ISA/210 (second sheet) (July 1998) 



INTERNATIONAL SEARCH REPORT 



PCT/US02/21426 



Continuation of B. FIELDS SEARCHED Item 3: 

EAST, BIOSIS, HCAPLUS, MEDLINE, COMPUSCIENCE, BIOTECHDS, BIOTECHNO 

terms: metaboli?, profile, lipid or lipoi? or lipomic, genom? or genotype or gene or genetic, obes? or diabet?, transplant 



Form PCT/ISA/210 (second sheet) (July 1998) 



